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A NEW MEMBER OF THE PAK PROTEIN FAMLY, NUCLEIC ACIDS AND 
METHODS RELATED TO THE SAME 

5 

FIELD OF THE INVENTION 

The present invention is directed, in part, to nucleic acid molecules encoding 
p21-activated kinase 5, novel polypeptides, and assays for screening compounds which 
bind to p21 -activated kinase 5 and/or modulate the activity of p21 -activated kinase-5. 

10 

BACKGROUND OF THE INVENTION 

The p21-activated kinase (PAK) family of serine- threonine kinases 
(reviewed by Bagrodia and Cerione, Trends in Cell Biology, vol. 9, pp. 350-355, 1999, and 
references contained therein) currently consists of four members, PAK1 (also known as 

15 PAKa), PAK2 (also known as PAKy), PAK3 (also known as PAK|3), and PAK4. The 
kinase activity of PAKs is stimulated by binding to the GTP-bound forms of Cdc42 and 
p21Rac (hereafter referred to as Rac), The C-terminal region of PAK proteins contains the 
kinase catalytic domain, which shows the highest conservation of sequence homology 
between different members of the family. The N-terminal region contains a conserved 

20 motif thought to be responsible for binding to Cdc42 and Rac GTPases (the 'GBD/CRIB' 
motif, Burbelo et al. 9 Journal of Biological Chemistry vol. 270, pp. 29071-29074, 1995). 
PAKs also contain in their N-terminal domains several copies of the PXXP protein motif 
that represents a binding site for SH3 protein domains. It has been shown that PAKs 1-3 
possess an N-terminal regulatory region (overlapping with the Cdc42/Rac binding domain) 

25 that is responsible for maintaining the kinase in a catalyically inactive form. PAK proteins 
have recently been shown to utilize sequences within the N-terminal domain for high- 
affinity binding to two SH3 domain-containing proteins, p85Cool-l/f}Pix and aPDC/Cool-2 
(Manser et aL, Mol. Cell vol. 1 pp.183-192, 1998, and Bagrodia et aL, J. Biol. Chem. Vol 
273, pp. 23633-23636, 1998). p85Cool-l/pPix localizes to peripheral focal complexes, and 

30 was found to recruit PAK1 from the cytoplasm to these complexes, while an alternatively 
spliced version of p85Cool-l/pPIX, p50Cool-l, appears to bind PAK3 and inhibits its 
kinase activity. Cool-2/aPIX stimulates PAK activity through an as yet unclear mechanism. 
Two tyrosine phosphorylated proteins, termed Cat-1 and Cat-2 (Cool-associated tyrosine 
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phosphorylated proteins 1 and 2, Bagrodia et al. y J Biol Chem vol. 274 pp. 22393-400, 
1999) have recently been found to interact with p85CooM/pPEX and CooI-2AxPIX, but not 
with p50Cool-l. It therefore appears likely that Cat-1 and Cat-2 play crucial roles in PAK 
regulation, since they only interact with forms of Cool/Pix that promote PAK activity. 

5 In addition to these interactions, PAK proteins have been shown to be 

recruited to activated tyrosine kinase receptors by the SH2/SH3 adapter protein Nek 
(Bokoch et al., J Biol Chem vol. 271 pp. 25746-9, 1996). This recruitment may provide a 
link between cell activation by growth factor receptors and PAK signaling pathways. It has 
also been shown that PAK kinase activity can be stimulated in the absence of Cdc42 or Rac 

10 binding by sphingosine and other membrane lipids (Bokoch et al. y J Biol Chem vol. 273, 
pp. 8137-44, 1998), but repressed by products of sphingolipid metabolism (Lian et aL, J 
Immunol vol. 161, pp. 4375-81, 1998). 

The downstream consequences of PAK activity are also multifold and 
complex. PAK proteins have been found to affect assembly of focal contacts, cytosketal 

15 organization, neurite outgrowth, lamellipodia formation, membrane ruffling, regulation of 
cell motility and morphology. PAKs have also been found to activate nuclear mitogen- 
activated protein kinases (MAPKs), and importantly, to phosphorylate the kinase Rafl, a 
downstream effector of Ras proteins: in fact, kinase-defective PAK mutants revert the 
oncogenic activity of mutated Ras (Tang et al. y Proc Natl Acad Sci USA vol. 95, pp. 5139- 

20 44, 1998). Additionally, PAKs become activated after stimulation of the T-cell receptor, 
and are required for activation of ERK2 and the NFAT transcription factor, and 
consequently gene expression by the T-cell receptor (Yablonski et aL, EMBO J vol. 17 
pp.5647-57, 1998). 

In summary, PAK proteins are subject to diverse regulatory inputs, and 
25 transmit signals to diverse downstream effectors which are essential for many signaling 
pathways that are fundamental for cell morphology, motility/migration, proliferation, 
differentiation or cell death. 

The present invention involves the surprising discovery of a novel 
polypeptide, herein designated p21 -activated kinase 5 (henceforth referred to as "PAKS") 
30 and its role as a key component, for example, in regulating cell proliferation, cell 
migration, cell differentiation, cytoskeletal organisation, gene expression, cell cycle 
progression, and cell death. PAKS is, thus, useful in the search for novel agents that can 
modify and/or control these processes. These and other aspects of the invention are 
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described below. 

SUMMARY OF THE INVENTION 

The present invention is directed to, in part, isolated nucleic acid molecules 
5 comprising SEQ ID NO:l or a fragment thereof; SEQ ID NO:2, or a fragment thereof; a 
nucleotide sequence complementary to at least a portion of SEQ ID NO: 1 or SEQ ID NO:2; 
a nucleotide sequence homologous to SEQ ID NO:l or SEQ ID NO:2 or a fragment 
thereof; a nucleotide sequence that encodes a polypeptide comprising SEQ ID NO:3 or a 
fragment thereof; or a nucleotide sequence that encodes a polypeptide comprising an amino 
10 acid sequence homologous to SEQ ID NO:3, or a fragment thereof. 

The present invention is also directed to recombinant expression vectors 
comprising any of the nucleic acid molecules described above. 

The present invention is also directed to host cells transformed with a 
recombinant expression vector comprising any of the nucleic acid molecules described 
15 above. 

The present invention is also directed to methods of producing a polypeptide 
comprising SEQ ID NO:3, or a homolog or fragment thereof, by introducing a recombinant 
expression vector comprising any of the nucleic acid molecules described above into a 
compatible host cell, growing the host cell under conditions suitable for expression of the 
20 polypeptide, and recovering the polypeptide from the host cell. 

The present invention is also directed to compositions comprising any of the 
nucleic acid molecules described above and an acceptable carrier or diluent. 

The present invention is also directed to isolated polypeptides encoded by 
any of the nucleic acid molecules described above. 
25 The present invention is also directed to compositions comprising a 

polypeptide encoded by any of the nucleic acid molecules described above and an 
acceptable carrier or diluent. 

The present invention is also directed to isolated antibodies which bind to an 
epitope on a polypeptide encoded by any of the nucleic acid molecules described above. 
30 The present invention is also directed to kits comprising antibodies which 

bind to a polypeptide encoded by any of the nucleic acid molecules described above and a 
negative control antibody. 

The present invention is also directed to methods of inducing an immune 
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response in a mammal against a polypeptide encoded by any of the nucleic acid molecules 
described above by administering to the mammal an amount of the polypeptide sufficient to 
induce the immune response. 

The present invention is also directed to methods of identifying a compound 
5 that binds to PAK5 by contacting PAK5 with a compound, and determining whether the 
compound binds PAK5. 

The present invention is also directed to methods of identifying a compound 
that binds a nucleic acid molecule encoding PAK5 by contacting PAK5 with a compound, 
and determining whether the compound binds the nucleic acid molecule. 
10 The present invention is also directed to methods of identifying a compound 

that modulates the activity of PAK5 by contacting PAK5 with a compound, and 
determining whether PAK5 activity is modified. 

The present invention is also directed to compounds that modulate PAK5 
activity identified by contacting PAK5 with the compound, and determining whether the 
15 compound modifies activity of PAK5, binds to PAK5, or binds to a nucleic acid molecule 
encoding PAK5. 

These and other aspects of the invention are described in greater detail 

below. 

20 DETAILED DESCRIPTION OF PREFERRED EMBODIMENTS 

The present invention provides, inter alia, isolated and purified 
polynucleotides that encode PAK5 or a portion thereof, vectors containing these 
polynucleotides, host cells transformed with these vectors, processes of making PAK5, 
methods of using the above polynucleotides and vectors, isolated and purified PAK5 and 
25 methods of screening compounds which modulate PAK5 activity. 

Various definitions are made throughout this document. Most words have 
the meaning that would be attributed to those words by one skilled in the art. Words 
specifically defined either below or elsewhere in this document have the meaning provided 
in the context of the present invention as a whole and as are typically understood by those 
30 skilled in the art. 

As used herein, the term "activity" refers to a variety of measurable indicia 
suggesting or revealing binding, either direct or indirect; affecting a response, i.e. having a 
measurable affect in response to some exposure or stimulus, including, for example, the 
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affinity of a compound for directly binding a polypeptide or polynucleotide of the 
invention, or, for example, measurement of amounts of upstream or downstream proteins or 
other similar functions after some stimulus or event. 

As used herein, the term "kinase activity" refers to the ability of the protein 
5 of the present invention to transfer the y-phosphate of a purine nucleotide triphosphate to 
the hydroxyl groups of protein substrates. 

As used herein, the abbreviations in lower case, pak5, refer to a gene, 
cDNA, RNA or nucleic acid sequence while the upper case version, PAK5, refers to a 
protein, polypeptide, peptide, oligopeptide, or amino acid sequence. 
10 As used herein, the term "antibody" is meant to refer to complete, intact 

antibodies, and Fab fragments and F(ab)2 fragments thereof. Complete, intact antibodies 
include monoclonal antibodies such as murine monoclonal antibodies, chimeric antibodies 
and humanized antibodies. 

As used herein, the term "binding" means the physical or chemical 
15 interaction between two proteins or compounds or associated proteins or compounds or 
combinations thereof. Binding includes ionic, non-ionic, Hydrogen bonds, Van der Waals, 
hydrophobic interactions, etc. The physical interaction, the binding, can be either direct or 
indirect, indirect being through or due to the effects of another protein or compound. 
Direct binding refers to interactions that do not take place through or due to the effect of 
20 another protein or compound but instead are without other substantial chemical 
intermediates. 

As used herein, the term "compound" means any identifiable chemical or 
molecqle, including, but not limited to, small molecule, peptide, protein, sugar, nucleotide, 
or nucleic acid, and such compound can be natural or synthetic. 

25 As used herein, the term "complementary" refers to Watson-Crick 

basepairing between nucleotide units of a nucleic acid molecule. 

As used herein, the term "contacting" means bringing together, either 
directly or indirectly, a compound into physical proximity to a polypeptide or 
polynucleotide of the invention. The polypeptide or polynucleotide can be in any number 

30 of buffers, salts, solutions etc. Contacting includes, for example, placing the compound 
into a beaker, microtiter plate, cell culture flask, or a microarray, such as a gene chip, or the 
like, which contains the nucleic acid molecule, or polypeptide encoding the PAK5 or a 
fragment thereof. 
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As used herein, the phrase "homologous nucleotide sequence," or 
"homologous amino acid sequence," or variations thereof, refers to sequences characterized 
by a homology, at the nucleotide level or amino acid level, of at least about 60%, more 
preferably at least about 70%, more preferably at least about 80%, more preferably at least 
about 90%, and most preferably at least about 95% to the entire SEQ ID NO:l or SEQ ID 
NO:2, or to at least a portion of SEQ ID NO: 1 or SEQ ID NO:2 which encodes a functional 
domain of the encoded polypeptide, or to SEQ ID NO:3. Homologous nucleotide 
sequences include those sequences coding for isoforms of PAK5 proteins. Such isoforms 
can be expressed in different tissues of the same organism as a result of, for example, 
alternative splicing of RNA. Alternatively, isoforms can be encoded by different genes. 
Homologous nucleotide sequences include nucleotide sequences encoding for PAK5 
proteins of species other than humans, including, but not limited to, mammals. 
Homologous nucleotide sequences also include, but are not limited to, naturally occurring 
allelic variations and mutations of the nucleotide sequences set forth herein. A 
homologous nucleotide sequence does not, however, include the nucleotide sequences 
encoding human PAK1 (SEQ ID NO:4), PAK2 (SEQ ID NO:5), PAK3 (SEQ ID NO:6) or 
PAK4 (SEQ ID NO:7). Homologous amino acid sequences include those amino acid 
sequences which encode conservative amino acid substitutions in SEQ ID NO:3, as well as 
polypeptides having PAK5-like kinase activity, or binding activities characteristic of 
PAK5. Percent homology can be determined by, for example, the Gap program (Wisconsin 
Sequence Analysis Package, Version 8 for Unix, Genetics Computer Group, University 
Research Park, Madison WI), using the default settings, which uses the algorithm of Smith 
and Waterman (Adv. Appl. Math, 1981, 2, 482-489, which is incorporated herein by 
reference in its entirety). Homology of the present invention to related known molecules is 
discussed below. 

As used herein, the term "isolated" nucleic acid molecule refers to a nucleic 
acid molecule (DNA or RNA) that has been removed from its native environment. 
Examples of isolated nucleic acid molecules include, but are not limited to, recombinant 
DNA molecules contained in a vector, recombinant DNA molecules maintained in a 
heterologous host cell, partially or substantially purified nucleic acid molecules, and 
synthetic DNA or RNA molecules. 

As used herein, the terms "modulates" or "modifies" means an increase or 
decrease in the amount, quality, or effect of a particular activity or protein. 
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As used herein, the term "oligonucleotide" refers to a series of linked 
nucleotide residues which has a sufficient number of bases to be used in a polymerase 
chain reaction (PCR). This short sequence is based on (or designed from) a genomic or 
cDNA sequence and is used to amplify, confirm, or reveal the presence of an identical, 
5 similar or complementary DNA or RNA in a particular cell or tissue. Oligonucleotides 
comprise portions of a DNA sequence having at least about 10 nucleotides and as many as 
about 50 nucleotides, preferably about 15 to 30 nucleotides. They are chemically 
synthesized and may be used as probes. 

As used herein, the term "probe" refers to nucleic acid sequences of variable 

10 length, preferably between at least about 10 and as many as about 6,000 nucleotides, 
depending on use. They are used in the detection of identical, similar, or complementary 
nucleic acid sequences. Longer length probes are usually obtained from a natural or 
recombinant source, are highly specific and much slower to hybridize than oligomers. They 
may be single- or double-stranded and carefully designed to have specificity in PCR, 

15 hybridization membrane-based, or ELISA-like technologies. 

As used herein, the phrase "stringent hybridization conditions" or "stringent 
conditions" refers to conditions under which a probe, primer, or oligonucleotide will 
hybridize to its target sequence, but to no other sequences. Stringent conditions are 
sequence-dependent and will be different in different circumstances. Longer sequences 

20 hybridize specifically at higher temperatures. Generally, stringent conditions are selected 
to be about 5°C lower than the thermal melting point (T m ) for the specific sequence at a 
defined ionic strength and pH. The T m is the temperature (under defined ionic strength, pH 
and nucleic acid concentration) at which 50% of the probes complementary to the target 
sequence hybridize to the target sequence at equilibrium. Since the target sequences are 

25 generally present in excess, at Tm, 50% of the probes are occupied at equilibrium. 
Typically, stringent conditions will be those in which the salt concentration is less than 
about 1.0 M sodium ion, typically about 0.01 to 1.0 M sodium ion (or other salts) at pH 7.0 
to 8.3 and the temperature is at least about 30°C for short probes, primers or 
oligonucleotides (e.g. 10 to 50 nucleotides) and at least about 60°C for longer probes, 

30 primers or oligonucleotides. Stringent conditions may also be achieved with the addition 
of destabilizing agents, such as formamide. 

The amino acid sequences are presented in the amino to carboxy direction, 
from left to right. The amino and carboxy groups are not presented in the sequence. The 
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nucleotide sequences are presented by single strand only, in the 5' to 3' direction, from left 
to right. Nucleotides and amino acids are represented in the manner recommended by the 
IUPAC-IUB Biochemical Nomenclature Commission or (for amino acids) by three letter 
codes. 

5 One aspect of the present invention is directed to nucleic acid molecules 

comprising novel nucleotide sequences encoding PAK5. The nucleic acid molecules are 
preferably either RNA or DNA, but may contain both RNA and DNA monomers or peptide 
nucleic acid monomers. The nucleic acid molecule may be single stranded or double 
stranded. The monomers of the nucleic acid molecules may be linked via conventional 
10 phosphodiester bonds or modified bonds, such as, for example, phosphorothioate bonds 
and the like. In addition, the sugar moieties of the monomers may be modified by, for 
example, addition of 2' substitutions which help confer nuclease resistance and/or cellular 
uptake. 

In a preferred embodiment of the invention, the nucleic acid molecule 

15 comprises SEQ ID NO:l, which is 2511 bases in length and comprises an open reading 
frame (ORF) of approximately 2157 nucleotides (from about position 352 to about position 
2508 within SEQ ID NO:l) which encodes PAK5. Alternatively, the nucleic acid molecule 
comprises a fragment of SEQ ID NO:l. Preferably, the fragment comprises from about 10 
to about 100 nucleotides, from about 101 to about 200 nucleotides, from about 201 to about 

20 300 nucleotides, from about 301 to about 400 nucleotides, from about 401 to about 500 
nucleotides, from about 501 to about 600 nucleotides, from about 601 to about 700 
nucleotides, from about 701 to about 800 nucleotides, from about 801 to about 900 
nucleotides, from about 901 to about 1000 nucleotides, from about 1001 to about 1100 
nucleotides, from about 1101 to about 1200 nucleotides, from about 1201 to about 1300 

25 nucleotides, from about 1301 to about 1400 nucleotides, from about 1401 to about 1500 
nucleotides, from about 1501 to about 1600 nucleotides, from about 1601 to about 1700 
nucleotides, from about 1701 to about 1800 nucleotides, from about 1801 to about 1900 
nucleotides, from about 1901 to about 2000 nucleotides, from about 2001 to about 2100 
nucleotides, from about 2101 to about 2200 nucleotides, from about 2201 to about 2300 

30 nucleotides, from about 2301 to about 2400 nucleotides,^ from about 2401 to about 2500 
nucleotides, from about 2501 ,to about 2511, and any combinations thereof. The fragment 
can be located within any portion of SEQ ID NO:l. The invention therefore provides 
fragments of PAK5 which comprise at least 14 and preferably at least 16, 18, 20, 25, 50, or 



WO 01/36602 



PCT/EP00/10736 



75 consecutive nucleotides. 

In another preferred embodiment of the invention, the nucleic acid molecule 
comprises SEQ ID NO:2, which is 2157 bases in length and comprises the ORF (from 
about position 352 to about position 2508 within SEQ ID NO:l) described above. 

5 Alternatively, the nucleic acid molecule comprises a fragment of SEQ ID NO:2. 
Preferably, the fragment comprises from about 10 to about 100 nucleotides, from about 101 
to about 200 nucleotides, from about 201 to about 300 nucleotides, from about 301 to about 
400 nucleotides, from about 401 to about 500 nucleotides, from about 501 to about 600 
nucleotides, from about 601 to about 700 nucleotides, from about 701 to about 800 

10 nucleotides, from about 801 to about 900 nucleotides, from about 901 to about 1000 
nucleotides, from about 1001 to about 1100 nucleotides, from about 1101 to about 1200 
nucleotides, from about 1201 to about 1300 nucleotides, from about 1301 to about 1400 
nucleotides, from about 1401 to about 1500 nucleotides, from about 1501 to about 1600 
nucleotides, from about 1601 to about 1700 nucleotides, from about 1701 to about 1800 

15 nucleotides, from about 1801 to about 1900 nucleotides, from about 1901 to about 2000 
nucleotides, from about 2001 to about 2100 nucleotides, from about 2101 to about 2157 
nucleotides, and any combinations thereof. The fragment can be located within any portion 
ofSEQIDNO:2. 

In another preferred embodiment of the invention, the nucleic acid molecule 
20 comprises a nucleotide sequence complementary to at least a portion of SEQ ID NO:l or 
SEQ ID NO:2. Preferably, the nucleic acid molecule comprises a nucleotide sequence 
complementary to the entire sequence recited in SEQ ID NO:l or SEQ ID NO:2. 
Alternatively, the nucleic acid molecule comprises a nucleotide sequence complementary 
to a portion of SEQ ID NO: 1 or SEQ ID NO:2 (j. e. , complementary to any of the fragments 
25 described above). Nucleotide sequences complementary to at least a portion of SEQ ID 
NO:l or SEQ ID NO:2 include, for example, oligonucleotides which hybridize under 
stringent hybridization conditions to at least a portion of SEQ ED NO:l or SEQ ID NO:2. 
Preferred oligonucleotides comprise at least about 10 nucleotides and as many as about 50 
nucleotides, preferably about 15 to 30 nucleotides. They are chemically synthesized and 
30 can be used as probes, primers, and as antisense agents. 

In another preferred embodiment of the invention, the nucleic acid molecule 
comprises a nucleotide sequence homologous to SEQ ID NO:l or SEQ ED NO:2. 
Preferably, the nucleotide sequence is at least about 60% homologous, more preferably at 
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least about 70% homologous, more preferably at least about 80% homologous, more 
preferably at least about 90% homologous, and most preferably at least about 95% 
homologous to the entire SEQ ID NO:l or SEQ ID NO:2. Alternatively, the nucleotide 
sequence is at least about 60% homologous, more preferably at least about 70% 
5 homologous, more preferably at least about 80% homologous, more preferably at least 
about 90% homologous, and most preferably at least about 95% homologous to a portion of 
SEQ ID NO:l or SEQ ID NO:2 which encodes a functional domain of the polypeptide 
encoded thereby. In addition, a nucleotide sequence homologous to SEQ ID NO: I or SEQ 
ID NO:2 also includes a fragment of the nucleotide sequence homologous to SEQ ID NO:l 

10 or SEQ ID NO: 2 of the lengths described above. 

In another preferred embodiment of the invention, the nucleic acid molecule 
comprises a nucleotide sequence that encodes a polypeptide comprising SEQ ID NO:3. 
The nucleic acid molecule preferably comprises SEQ ID NO:2 or comprises SEQ ED NO:2 
containing codon substitutions which reflect the degeneracy of the genetic code. As is well 

15 known in the art, because of the degeneracy of the genetic code, there are numerous other 
DNA and RNA molecules that can code for the same polypeptide as that encoded by SEQ 
ID NO:2. The present invention, therefore, contemplates these other DNA and RNA 
molecules which, on expression, encode the polypeptide of SEQ ID NO:3. DNA and RNA 
molecules other than those specifically disclosed herein characterized simply by a change 

20 in a codon for a particular amino acid, are within the scope of the present invention. 

As is well known in the art, because of the degeneracy of the genetic code, 
there are numerous other DNA and RNA molecules that can code for the same polypeptide 
as that encoded by the aforementioned pak5 gene. The present invention, therefore, 
contemplates those other DNA and RNA molecules which, on expression, encode the 

25 polypeptide of SEQ ID NO: 3. Having identified the amino acid residue sequence encoded 
by a PAK5 gene, and with knowledge of all triplet codons for each particular amino acid 
residue, it is possible to describe all such encoding RNA and DNA sequences. DNA and 
RNA molecules other than those specifically disclosed herein characterized simply by a 
change in a codon for a particular amino acid, are within the scope of this invention. 

30 

A table of amino acids and their representative abbreviations, symbols and 
codons is set forth below in the following Table 1. 
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Table 1 



Amino acid 


Abbrev. 


Symbol 


Codon(s) 










Alanine 


Ala 


A 


GCA 


GCC 


GCG 


GCU 






Cysteine 


Cys 


C 


UGA 


UGU 










Aspartic acid 


Asp 


D 


GAC 


GAU 










Glutamic acid 


Glu 


E 


GAA 


GAG 










Phenylalanine 


Phe 


F 


UUC 


uuu 










Glycine 


Gly 


G 


GGA 


GGC 


GGG 


GGU 






Histidine 


His 


H 


CAC 


CAU 










Isoleucine 


He 


I 


AUA 


AUC 


AUU 








Lysine 


Lys 


K 


AAA 


AAG 










Leucine 


Leu 


L 


UUA 


UUG 


CUA 


cue 


CUG 


CUU 


Methionine 


Met 


M 


AUG 












Asparagine 


Asn 


N 


AAC 


AAU 










Proline 


Pro 


P 


CCA 


CCC 


CCG 


ecu 






Glutamine 


Gin 


Q 


CAA 


CAG 










Arginine 


Arg 


R 


AGA 


AGG 


CGA 


CGC 


CGG 


CGU 


Serine 


Ser 


S 


AGC 


AGU 


UCA 


UCC 


UCG 


ucu 


Threonine 


Thr 


T 


ACA 


ACC 


ACG 


ACU 






Valine 


Val 


V 


GUA 


GUC 


GUG 


GUU 






Tryptophan 


Trp 


w 


UGG 












Tyrosine 


Tyr 


Y 


UAC 


UAU 











As is well known in the art, codons constitute triplet sequences of 
nucleotides in mRNA molecules and, as such, are characterized by the base uracil (U) in 

5 place of base thymidine (T) (which is present in DNA molecules). A simple change in a 
codon for the same amino acid residue within a polynucleotide will not change the 
sequence or structure of the encoded polypeptide. 

Alternatively, the nucleic acid molecule comprises a nucleotide sequence 
that encodes a fragment of the polypeptide encoding SEQ ID NO:3. Preferably, the 

10 fragment comprises from about 5 to about 20 amino acids, from about 21 to about 40 
amino acids, from about 41 to about 60 amino acids, from about 61 to about 80 amino 
acids, from about 81 to about 100 amino acids, from about 101 to about 120 amino acids, 
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from about 121 to about 140 amino acids, from about 141 to about 160 amino acids, from 
about 161 to about 180 amino acids, from about 181 to about 200 amino acids, from about 
201 to about 220 amino acids, from about 221 to about 240 amino acids, from about 241 to 
about 260 amino acids, from about 261 to about 280 amino acids, from about 281 to about 
5 300 amino acids, from about 301 to about 320 amino acids, from about 321 to about 340 
amino acids, from about 341 to about 360 amino acids, from about 361 to about 380 amino 
acids, from about 381 to about 400 amino acids, from about 401 to about 420 amino acids, 
from about 421 to about 440 amino acids, from about 441 to about 460 amino acids, from 
about 461 to about 480 amino acids, from about 481 to about 500 amino acids, from about 

10 501 to about 520 amino acids, from about 521 to about 540 amino acids, from about 541 to 
about 560 amino acids, from about 561 to about 580 amino acids, from about 581 to about 
600 amino acids, from about 601 to about 620 amino acids, from about 621 to about 640 
amino acids, from about 641 to about 660 amino acids, from about 661 to about 680 amino 
acids, from about 681 to about 700 amino acids, from about 701 to about 719 amino acids, 

15 and any combinations thereof. The fragment can be located within any portion of SEQ ID 
NO:3. 

In another preferred embodiment of the invention, the nucleic acid molecule 
comprises a nucleotide sequence that encodes a polypeptide comprising an amino acid 
sequence homologous to SEQ ID NO:3. Alternatively, the nucleic acid molecule 

20 comprises a nucleotide sequence that encodes a fragment of the polypeptide comprising an 
amino acid sequence homologous to SEQ ID NO:3. Preferably, the fragment comprises 
from about 5 to about 20 amino acids, from about 21 to about 40 amino acids, from about 
41 to about 60 amino acids, from about 61 to about 80 amino acids, from about 81 to about 
100 amino acids, from about 101 to about 120 amino acids, from about 121 to about 140 

25 amino acids, from about 141 to about 160 amino acids, from about 161 to about 180 amino 
acids, from about 181 to about 200 amino acids, from about 201 to about 220 amino acids, 
from about 221 to about 240 amino acids, from about 241 to about 260 amino acids, from 
about 261 to about 280 amino acids, from about 281 to about 300 amino acids, from about 
301 to about 320 amino acids, from about 321 to about 340 amino acids, from about 341 to 

30 about 360 amino acids, from about 361 to about 380 amino acids, from about 381 to about 
400 amino acids, from about 401 to about 420 amino acids, from about 421 to about 440 
amino acids, from about 441 to about 460 amino acids, from about 461 to about 480 amino 
acids, from about 481 to about 500 amino acids, from about 501 to about 520 amino acids, 
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from about 521 to about 540 amino acids, from about 541 to about 560 amino acids, from 
about 561 to about 580 amino acids, from about 581 to about 600 amino acids, from about 
601 to about 620 amino acids, from about 621 to about 640 amino acids, from about 641 to 
about 660 amino acids, from about 661 to about 680 amino acids, from about 681 to about 
5 700 amino acids, from about 701 to about 719 amino acids, and any combinations thereof. 
The fragment can be located within any portion of SEQ ID NO:3. 

With the knowledge of the nucleotide sequence information disclosed in the 
present invention, one skilled in the art can identify and obtain nucleotide sequences which 
encode PAK5 from different sources (i.e., different tissues or different organisms) through 
10 a variety of means well known to the skilled artisan and disclosed by, for example, 
Sambrook et ai y "Molecular cloning: a laboratory manual", Second Edition, Cold Spring 
Harbor Press, Cold Spring Harbor, NY (1989), which is incoiporated herein by reference in 
its entirety. 

For example, DNA which encodes PAK5 may be obtained by screening of 

15 mRNA, cDNA, or genomic DNA with oligonucleotide probes generated from the PAK5 
gene sequence information provided herein. Probes may be labeled with a detectable 
group, such as a fluorescent group, a radioactive atom or a chemiluminescent group in 
accordance with procedures known to the skilled artisan and used in conventional 
hybridization assays, as described by, for example, Sambrook et al. 

20 A nucleic acid molecule comprising any of the PAK5 nucleotide sequences 

described above can alternatively be recovered by use of the polymerase chain reaction 
(PCR) procedure, with the PCR oligonucleotide primers produced from the nucleotide 
sequences provided herein. See U.S. Patent Numbers 4,683,195 to Mullis et al and 
4,683,202 to Mullis. The PCR reaction provides a method for selectively increasing the 

25 concentration of a particular nucleic acid sequence even when that sequence has not been 
previously purified and is present only in a single copy in a particular sample. The method 
can be used to amplify either single- or double-stranded DNA. The essence of the method 
involves the use of two oligonucleotides probes to serve as primers for the template- 
dependent, polymerase mediated replication of a desired nucleic acid molecule. 

30 A wide variety of alternative cloning and in vitro amplification 

methodologies are well known to those skilled in the art. Examples of these techniques are 
found in, for example, Berger et al. y Guide to Molecular Cloning Techniques, Methods in 
Enzymology 152 Academic Press, Inc., San Diego, CA (Berger), which is incorporated 
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herein by reference in its entirety. 

The nucleic acid molecules of the present invention, and fragments derived 
therefrom, are useful for screening for restriction fragment length polymorphism (RFLP) 
associated with certain disorders, as well as for genetic mapping. 
5 Antisense oligonucleotides, or fragments of SEQ ID NO: 1 or SEQ ID NO:2, 

or sequences complementary thereto, derived from the nucleotide sequences of the present 
invention encoding PAK5 are useful as diagnostic tools for probing gene expression in 
various tissues. For example, tissue can be probed in situ with oligonucleotide probes 
carrying detectable groups by conventional autoradiography techniques to investigate 

10 native expression of this enzyme or pathological conditions relating thereto. Antisense 
oligonucleotides are preferably directed to regulatory regions of SEQ ID NO:l or SEQ ED 
NO:2 or mRNA corresponding thereto, including, but not limited to, the initiation codon, 
TATA box, enhancer sequences, and the like. 

Automated sequencing methods were used to obtain or verify the nucleotide 

15 sequence of pak5. The pak5 nucleotide sequences of the present invention were obtained 
for both DNA strands, and are believed to be 100% accurate. However, as is known in the 
art, nucleotide sequence obtained by automated methods may contain some errors. 
Nucleotide sequences determined by automation are typically at least about 90%, more 
typically at least about 95% to at least about 99.9% identical to the actual nucleotide 

20 sequence of a given nucleic acid molecule. The actual sequence may be more precisely 
determined using manual sequencing methods, which are well known in the art. An error 
in sequence which results in an insertion or deletion of one or more nucleotides may result 
in a frame shift in translation such that the predicted amino acid sequence will differ from 
that which would be predicted from the actual nucleotide sequence of the nucleic acid 

25 molecule, starting at the point of the mutation. 

Another aspect of the present invention is directed to vectors, or 
recombinant expression vectors, comprising any of the nucleic acid molecules described 
above. Vectors are used herein either to amplify DNA or RNA encoding PAK5 and/or to 
express DNA which encodes PAK5. Preferred vectors include, but are not limited to, 

30 plasmids, phages, cosmids, episomes, viral particles or viruses, and integratable DNA 
fragments (/.*., fragments integratable into the host genome by homologous 
recombination). Preferred viral particles include, but are not limited to, adenoviruses, 
parvoviruses, herpesviruses, poxviruses, adeno-associated viruses, Semliki Forest viruses, 
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vaccinia viruses, and retroviruses. Preferred expression vectors include, but are not limited 
to, pcDNA3 (Invitrogen) and pSVL (Pharmacia Biotech). Other expression vectors 
include, but are not limited to, pSPORT vectors, pGEM vectors (Promega), 
pPROEXvectors (LTI, Bethesda, MD), Bluescript vectors (Stratagene), pQE vectors 
5 (Qiagen), pSE420 (Invitrogen), and pYES2 (Invitrogen). 

Preferred expression vectors are replicable DNA constructs in which a DNA 
sequence encoding PAK5 is operably linked to suitable control sequences capable of 
effecting the expression of the PAK5 in a suitable host. DNA regions are operably linked 
when they are functionally related to each other. For example, a promoter is operably 

10 linked to a coding sequence if it controls the transcription of the sequence. Amplification 
vectors do not require expression control domains, but rather need only the ability to 
replicate in a host, usually conferred by an origin of replication, and a selection gene to 
facilitate recognition of transformants. The need for control sequences into the expression 
vector will vary depending upon the host selected and the transformation method chosen. 

15 Generally, control sequences include a transcriptional promoter, an optional operator 
sequence to control transcription, a sequence encoding suitable mRNA ribosomal binding, 
and sequences which control the termination of transcription and translation. 

Preferred vectors preferably contain a promoter which is recognized by the 
host organism. The promoter sequences of the present invention may be prokaryotic, 

20 eukaryotic or viral. Examples of suitable prokaryotic sequences include the P R and P L 
promoters of bacteriophage lambda (The bacteriophage Lambda, Hershey, A. D., Ed., Cold 
Spring Harbor Press, Cold Spring Harbor, NY (1973), which is incorporated herein by 
reference in its entirety; Lambda II, Hendrix, R. W., Ed., Cold Spring Harbor Press, Cold 
Spring Harbor, NY (1980), which is incorporated herein by reference in its entirety); the 

25 trp, recA, heat shock, and lacZ promoters of E. coli and the SV40 early promoter (Benoist, 
et al Nature, 1981, 290, 304-310, which is incorporated herein by reference in its entirety). 
Additional promoters include, but are not limited to, mouse mammary tumor virus, long 
terminal repeat of human immunodeficiency virus, maloney virus, cytomegalovirus 
immediate early promoter, Epstein Barr virus, rous sarcoma virus, human actin, human 

30 myosin, human hemoglobin, human muscle creatine, and human metalothionein. 

Additional regulatory sequences can also be included in preferred vectors. 
Preferred examples of suitable regulatory sequences are represented by the Shine-Dalgarno 
of the replicase gene of the phage MS-2 and of the gene ell of bacteriophage lambda. The 
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Shine-Dalgamo sequence may be directly followed by the DNA encoding PAK5 and result 
in the expression of the mature PAK5 protein. 

Moreover, suitable expression vectors can include an appropriate marker 
which allows the screening of the transformed host cells. The transformation of the 
selected host is carried out using any one of the various techniques well known to one of 
skill in the art and described in Sambrook et aL, supra. 

An origin of replication can also be provided either by construction of the 
vector to include an exogenous origin or may be provided by the host cell chromosomal 
replication mechanism. If the vector is integrated into the host cell chromosome, the latter 
may be sufficient. Alternatively, rather than using vectors which contain viral origins of 
replication, one skilled in the art can transform mammalian cells by the method of co- 
transformation with a selectable marker and pak5 DNA. An example of a suitable marker 
is dihydrofolate reductase (DHFR) or thymidine kinase (see, U.S. Patent No. 4,399,216). 

Nucleotide sequences encoding PAK5 may be recombined with vector DNA 
in accordance with conventional techniques, including blunt-ended or staggered-ended 
termini for ligation, restriction enzyme digestion to provide appropriate termini, filling in 
of cohesive ends as appropriate, alkaline phosphatase treatment to avoid undesiderable 
joining, and ligation with appropriate ligases. Techniques for such manipulation are 
disclosed by Sambrook et aL, supra and are well known in the art. Methods for 
construction of mammalian expression vectors are disclosed in, for example, Okayama et 
aL, Mol Cell Biol, 1983, J, 280, Cosman et at, Mol Immunol , 1986, 25, 935, Cosman et 
aU Nature, 1984, 312, 768, EP-A-0367566, and WO 91/18982, each of which is 
incorporated herein by reference in its entirety. 

Another aspect of the present invention is directed to transformed host cells 
having an expression vector comprising any of the nucleic acid molecules described above. 
Expression of the nucleotide sequence occurs when the expression vector is introduced into 
an appropriate host cell. Suitable host cells for expression of the polypeptides of the 
invention include, but are not limited to, prokaryotes, yeast, and eukaryotes. If a 
prokaryotic expression vector is employed, then the appropriate host cell would be any 
prokaryotic cell capable of expressing the cloned sequences. Suitable prokaryotic cells 
include, but are not limited to, bacteria of the genera Escherichia, Bacillus, Salmonella, 
Pseudomonas, Streptomyces, and Staphylococcus. 

If an eukaryotic expression vector is employed, then the appropriate host 
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cell would be any eukaryotic cell capable of expressing the cloned sequence. Preferably, 
eukaryotic cells are cells of higher eukaryotes. Suitable eukaryotic cells include, but are 
not limited to, non-human mammalian tissue culture cells and human tissue culture cells. 
Preferred host cells include, but are not limited to, insect cells, HeLa cells, Chinese hamster - 
5 ovary cells (CHO cells), African green monkey kidney cells (COS cells), human 293 cells, 
and murine 3T3 fibroblasts. Propagation of such cells in cell culture has become a routine 
procedure {see, Tissue Culture, Academic Press, Kruse and Patterson, eds. (1973), which is 
incorporated herein by reference in its entirety). 

In addition, a yeast host may be employed as a host cell. Preferred yeast 

10 cells include, but are not limited to, the genera Saccharomyces, Pichia, and Kluveromyces. 
Preferred yeast hosts are S. cerevisiae and P. pastoris. Preferred yeast vectors can contain 
an origin of replication sequence from a 2T yeast plasmid, an autonomously replication 
sequence (ARS), a promoter region, sequences for polyadenylation, sequences for 
transcription termination, and a selectable marker gene. Shuttle vectors for replication in 

15 both yeast and E. coli are also included herein. 

Alternatively, insect cells may be used as host cells. In a preferred 
embodiment, the polypeptides of the invention are expressed using a baculovirus 
expression system {see, Luckow et al, Bio/Technology, 1988, 6, 47, Baculovirus 
Expression Vectors: A Laboratory Manual, O'Reilly et al (Eds.), W.H. Freeman and 

20 Company, New York, 1992, and U.S. Patent No. 4,879,236, each of which is incorporated 
herein by reference in its entirety). In addition, the MAXBAC™ complete baculovirus 
expression system (Invitrogen) can, for example, be used for production in insect cells. 

Another aspect of the present invention is directed to compositions, 
including pharmaceutical compositions, comprising any of the nucleic acid molecules or 

25 recombinant expression vectors described above and an acceptable carrier or diluent. 
Preferably, the carrier or diluent is pharmaceutical ly acceptable. Suitable carriers are 
described in the most recent edition of Remington's Pharmaceutical Sciences, A. Osol, a 
standard reference text in this field, which is incorporated herein by reference in its 
entirety. Preferred examples of such carriers or diluents include, but are not limited to, 

30 water, saline, Ringer's solution, dextrose solution, and 5% human serum albumin. 
Liposomes and nonaqueous vehicles such as fixed oils may also be used. The formulations 
are sterilized by commonly used techniques. 

Another aspect of the present invention is directed to an isolated polypeptide 
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encoded by a nucleic acid molecule described above. In preferred embodiments of the 
invention, the isolated polypeptide comprises the amino acid sequences set forth in SEQ ED 
NO:3. Alternatively, the polypeptide is a fragment of the polypeptide encoding SEQ ID 
NO:3. Preferably, the fragment comprises from about 5 to about 20 amino acids, from 
5 about 21 to about 40 amino acids, from about 41 to about 60 amino acids, from about 61 to 
about 80 amino acids, from about 81 to about 100 amino acids, from about 101 to about 
120 amino acids, from about 121 to about 140 amino acids, from about 141 to about 160 
amino acids, from about 161 to about 180 amino acids, from about 181 to about 200 amino 
acids, from about 201 to about 220 amino acids, from about 221 to about 240 amino acids, 

10 from about 241 to about 260 amino acids, from about 261 to about 280 amino acids, from 
about 281 to about 300 amino acids, from about 301 to about 320 amino acids, from about 
321 to about 340 amino acids, from about 341 to about 360 amino acids, from about 361 to 
about 380 amino acids, from about 381 to about 400 amino acids, from about 401 to about 
420 amino acids, from about 421 to about 440 amino acids, from about 441 to about 460 

15 amino acids, from about 461 to about 480 amino acids, from about 481 to about 500 amino 
acids, from about 501 to about 520 amino acids, from about 521 to about 540 amino acids, 
from about 541 to about 560 amino acids, from about 561 to about 580 amino acids, from 
about 581 to about 600 amino acids, from about 601 to about 620 amino acids, from about 
621 to about 640 amino acids, from about 641 to about 660 amino acids, from about 661 to 

20 about 680 amino acids, from about 681 to about 700 amino acids, from about 701 to about 
719 amino acids, and any combinations thereof. The fragment can be located within any 
portion of SEQIDNO:3. 

In another preferred embodiment of the invention, the polypeptide 
comprises an amino acid sequence homologous to SEQ ID NO:3 or a fragment thereof as 

25 described above. It is to be understood that the present invention includes proteins 
homologous to, and having essentially the same biological properties as, the polypeptides 
encoded by the nucleotide sequences described herein, i.e., a variant. This definition is 
intended to encompass isoforms and natural allelic variants of the pak5 genes described 
herein. These variant forms may result from, for example, alternative splicing or 

30 differential expression in different tissue of the same source organism. The variant forms 
may be characterized by, for example, amino acid insertion(s), deletion(s) or 
substitution(s). In this connection, a variant form having an amino acid sequence which 
has at least about 70% sequence homology, at least about 80% sequence homology, 
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preferably about 90% sequence homology, more preferably about 95% sequence homology 
and most preferably about 98% sequence homology to SEQ ID NO:3 is contemplated as 
being included in the present invention. A preferred homologous polypeptide comprises at 
least one conservative amino acid substitution compared to SEQ ID NO:3. Amino acid 
5 "insertions", "substitutions" or "deletions" are changes to or within an amino acid 
sequence. The variation allowed in a particular amino acid sequence may be 
experimentally determined by producing the peptide synthetically or by systematically 
making insertions, deletions, or substitutions of nucleotides in the pak5 sequence using 
recombinant DNA techniques. 

10 Alterations of the naturally occurring amino acid sequence can be 

accomplished by any of a number of known techniques. For example, mutations can be 
introduced into the polynucleotide encoding a polypeptide at particular locations by 
procedures well known to the skilled artisan, such as oligonucleotide-directed mutagenesis, 
which is described by Walder et ai, Gene, 1986, 42, 133, Bauer et al, Gene, 1985, 37, 73, 

15 Craik, BioTechniques, January 1985, pp.12- 19, Smith et ai, Genetic Engineering: 
Principles and Methods, Plenum Press (1981), and U.S. Patent Numbers 4,518,584 and 
4,737,462, each of which is incorporated herein by reference in its entirety. 

Preferably, a PAK5 variant of the present invention will exhibit substantially 
the biological activity of a naturally occurring PAK5 polypeptide. By "exhibit 

20 substantially the biological activity of a naturally occurring PAK5 polypeptide" is meant 
that PAK5 variants within the scope of the invention can comprise conservatively 
substituted sequences, meaning that one or more amino acid residues of a PAK5 
polypeptide are replaced by different residues that do not alter the secondary and/or tertiary 
structure of the PAK5 polypeptide. Such substitutions may include the replacement of an 

25 amino acid by a residue having similar physicochemical properties, such as substituting one 
aliphatic residue (He, Val, Leu or Ala) for another, or substitution between basic residues 
Lys and Arg, acidic residues Glu and Asp, amide residues Gin and Asn, hydroxyl residues 
Ser and Tyr, or aromatic residues Phe and Tyr. Further information regarding making 
phenotypically silent amino acid exchanges can be found in Bowie et al., Science, 1990, 

30 24 7, 1306-1310, which is incorporated herein by reference in its entirety. Other PAK5 
variants which might retain substantially the biological activities of PAK5 are those where 
amino acid substitutions have been made in areas outside functional regions of the protein. 

The polypeptides to be expressed in such host cells may also be fusion 
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proteins which include regions from heterologous proteins. Such regions may be included 
to allow, e.g., secretion, improved stability, or facilitated purification of the polypeptide. 
For example, a sequence encoding an appropriate signal peptide can be incorporated into 
expression vectors. A DNA sequence for a signal peptide (secretory leader) may be fused 
5 in-frame to the polynucleotide sequence so that the polypeptide is translated as a fusion 
protein comprising the signal peptide. A signal peptide that is functional in the intended 
host cell promotes extracellular secretion of the polypeptide. Preferably, the signal 
sequence will be cleaved from the polypeptide upon secretion of the polypeptide from the 
cell. Thus, preferred fusion proteins can be produced in which the N-terminus of PAK5 is 

10 fused to a carrier peptide. 

In one embodiment, the polypeptide comprises a fusion protein which 
includes a heterologous region used to facilitate purification of the polypeptide. Many of 
the available peptides used for such a function allow selective binding of the fusion protein 
to a binding partner. A preferred binding partner includes one or more of the IgG binding 

15 domains of protein A are easily purified to homogeneity by affinity chromatography on, for 
example, IgG-coupled Sepharose. Alternatively, many vectors have the advantage of 
carrying a stretch of histidine residues that can be expressed at the N-terminal or C- 
terminal end of the target protein. Thus the protein of interest can be recovered by metal 
chelation chromatography. A nucleotide sequence encoding a recognition site for a 

20 proteolytic enzyme such as enterokinase, factor X or, procollagenase or thrombin may 
immediately precede the sequence for PAK5 to permit cleavage of the fusion protein to 
obtain the mature PAK5 protein. Additional examples of fusion partners include, but are 
not limited to, the yeast I-factor, the honeybee melatin leader in sf9 insect cells, 6-His tag, 
thioredoxin tag, hemaglutinin tag, GST tag, and OmpA signal sequence tag. As will be 

25 understood by one of skill in the art, the binding partner which recognizes and binds to the 
peptide may be any molecule or compound including metal ions (e.g., metal affinity 
columns), antibodies, or fragments thereof, and any protein or peptide which binds the 
peptide, such as the FLAG tag. 

The polypeptides of the invention can be used as antigens for raising 

30 antibodies against the same and used to screen for compounds that modulate the activity of 
PAK5. PAK5 can also be used in compositions. Accordingly, the invention relates to 
PAK5 or an antibody according to the invention for use as a medicament as well as to the 
use of the molecules in the manufacture of a medicament directed towards conditions 
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wherein PAK5 activity is defective or unregulated: these are expected to include, but are 
not limited to, cancer, angiogenesis-related diseases, diseases of the central nervous system, 
diseases due to inappropriate activation of immune responses. The molecules used as 
medicaments according to the invention may be the polypeptides or antibodies described 
5 herein as well as any novel substance identified in a screening method described herein. 

In another aspect, the invention provides PAK5 polypeptides with or 
without associated native pattern glycosylation, acylation, sialylation, or other post- 
translational modifications. PAK5 expressed in yeast or mammalian expression systems 
(discussed below) may be similar to or significantly different from a native PAK5 

10 polypeptide in molecular weight and glycosylation pattern. Of course, expression of PAK5 
in bacterial expression systems will provide non-glycosylated PAK5. 

Another aspect of the present invention is directed to compositions, 
including pharmaceutical compositions, comprising any of the polypeptides described 
above and an acceptable carrier or diluent. Preferably, the carrier or diluent is 

15 pharmaceutical^ acceptable. Compositions comprising a polypeptide, as described above, 
can be used to, for example, induce antibody formation and to induce an immune response 
for use in, for example, vaccine preparations. 

Another aspect of the present invention is directed to methods of producing 
a polypeptide comprising SEQ ED NO:3, or a homolog or fragment thereof, comprising 

20 introducing any of the recombinant expression vectors described above into compatible 
host cells, growing the host cells under conditions for expression of the polypeptide, and 
recovering the polypeptide from the host cells. Eukaryotic systems are preferred since they 
provide a variety of processing mechanisms which result in, for example, glycosylation, 
carboxy-terminal amidation, oxidation or derivatization of certain amino acid residues, 

25 conformational control, and so forth. 

The polypeptides of the present invention are preferably provided in an 
isolated form, are preferably substantially purified, and most preferably are purified to 
homogeneity. Host cells are preferably lysed and the polypeptide is recovered from the 
lysate of the host cells. Alternatively, the polypeptide is recovered by purifying the cell 

30 culture medium from the host cells, preferably without lysing the host cell. The 
polypeptides can be recovered and purified from recombinant cell cultures by well-known 
methods, including ammonium sulfate or ethanol precipitation, anion or cation exchange 
chromatography, phosphocellulose chromatography, hydrophobic interaction 
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chromatography, affinity chromatography, hydroxy] apatite chromatography and lectin 
chromatography. 

In addition to producing these proteins by recombinant techniques, 
automated amino acid synthesizers may also be employed to produce PAK5 polypeptides, 
5 or fragments a homologous protein thereof. 

Another aspect of the present invention is directed to an antibody or 
antibodies which bind to an epitope on any of the polypeptides described herein. 
Preferably, the antibody binds to an epitope within SEQ ED NO:3. The antibodies 
according to the invention can be monoclonal or polyclonal and include individual, allelic, 

10 strain or species variants, or fragments thereof, both in their naturally occurring (full- 
length) forms and recombinant forms. Additionally, the antibodies are raised to the present 
proteins in either their native configuration or in non-native configurations. Anti-idiotypic 
antibodies can also be generated. 

Hybridomas which produce antibodies that bind to the polypeptides of the 

15 invention, and the antibodies themselves, are useful in the isolation and purification of the 
polypeptides. In addition, antibodies may be specific inhibitors of PAK5 activity. 
Antibodies which specifically bind to the polypeptides of the invention can be used to 
purify the protein from natural sources using well known techniques and readily available 
starting materials. Such antibodies can also be used to purify the protein from material 

20 present when producing the protein by recombinant DNA methodology. 

Many methods of making antibodies are known to persons skilled in the art. 
For techniques for preparing monoclonal antibodies, see e.g. Stiites et al (eds.), Basic and 
Clinical Immunology (4 th ed.), Lange Medical Publications, Los Altos, CA, which is 
incorporated herein by reference in its entirety, and references cited therein. Techniques 

25 that involve selection of libraries of recombinant antibodies in phage or similar vectors are 
described in Huse et al, Science, 1989, 246, 1275-1281, which is incorporated herein by 
reference in its entirety. The production of antibodies and the protein structures of 
complete, intact antibodies, Fab fragments and F(ab) 2 fragments and the organization of the 
genetic sequences that encode such molecules are well known and are also described, for 

30 example, in Harlow, E. and D. Lane (1988) ANTIBODIES: A Laboratory Manual, Cold 
Spring Harbor Laboratory, Cold Spring Harbor, NY., which is incorporated herein by 
reference. Briefly, for example, a polypeptide of the invention is injected into mice. The 
spleen of the mouse is removed, the spleen cells are isolated and fused with immortalized 
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mouse cells. The hybrid cells, or hybridomas, are cultured and those cells which secrete 
antibodies are selected. The antibodies are analyzed and, if found to specifically bind to 
the polypeptide, the hybridoma which produces them is cultured to produce a continuous 
supply of antibodies. 

5 The present invention is also directed to kits, including pharmaceutical kits. 

The kits can comprise any of the nucleic acid molecules described above, any of the 
polypeptides described above, or any antibody which binds to a polypeptide of the 
invention as described above, as well as a negative control. The kit preferably comprises 
additional components, such as, for example, instructions, solid supports, reagents helpful 

10 for quantification, and the like. 

Another aspect of the present invention is directed to methods of inducing 
an immune response in a mammal against a polypeptide of the invention by administering 
to the mammal an amount of the polypeptide sufficient to induce an immune response. The 
amount will be dependent on the animal species, size of the animal, and the like but can be 

15 determined by those skilled in the art. 

Another aspect of the present invention is directed to methods of identifying 
compounds which bind to either PAK5 or nucleic acid molecules encoding PAK5, 
comprising contacting PAK5, or a nucleic acid molecule encoding the same, with a 
compound, and determining whether the compound binds PAK5, or a nucleic acid 

20 molecule encoding the same. Binding can be determined by binding assays which are well 
known to the skilled artisan, including, but not limited to, gel-shift assays, Western blots, 
radiolabeled competition assay, phage-based expression cloning, co-fractionation by 
chromatography, co-precipitation, cross linking, interaction trap/two-hybrid analysis, 
southwestern analysis, ELISA, and the like, which are described in, for example, Current 

25 Protocols in Molecular Biology, 1999, John Wiley & Sons, NY, which is incorporated 
herein by reference in its entirety. The compounds to be screened include (which may 
include compounds which are suspected to bind PAK5, or a nucleic acid molecule 
encoding the same), but are not limited to, extracellular, intracellular, biologic or chemical 
origin. The PAK5 polypeptide or polynucleotide employed in such a test may either be 

30 free in solution, attached to a solid support, borne on a cell surface or located 
intracellularly. One skilled in the art can, for example, measure the formation of 
complexes between PAK5 and the compound being tested. Alternatively, one skilled in the 
art can examine the diminution in complex formation between PAK5 and its substrate 
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caused by the compound being tested. 

Another aspect of the present invention is directed to methods of identifying 
compounds which modulate (i.e., increase or decrease) an activity of PAK5 comprising 
contacting PAK5 with a compound, and determining whether the compound modifies 
5 activity of PAK5. The activity in the presence of the test compared is measured to the 
activity in the absence of the test compound. Where the activity of the sample containing 
the test compound is higher than the activity in the sample lacking the test compound, the 
compound will have increased activity. Similarly, where the activity of the sample 
containing the test compound is lower than the activity in the sample lacking the test 

iO compound, the compound will have inhibited activity. 

The present invention is particularly useful for screening compounds by 
using PAK5 in any of a variety of drug screening techniques. The compounds to be 
screened include (which may include compounds which are suspected to modulate PAK5 
activity), but are not limited to, extracellular, intracellular, biologic or chemical origin. The 

15 PAK5 polypeptide employed in such a test may be in any form, preferably, free in solution, 
attached to a solid support, borne on a cell surface, or located intraeellularly. One skilled in 
the art can, for example, measure the formation of complexes between PAK5 and the 
compound being tested. Alternatively, one skilled in the art can' examine the diminution in 
complex formation between PAK5 and its substrate caused by the compound being tested. 

20 PAK5 is herein predicted to be a serine/threonine protein kinase which, 

amongst the kinase families known to date, has the highest degree of functional and 
structural homology to the previously described STE20-like PAK kinases. Compelling 
evidence for this is present in the primary structure of PAK5 protein (SEQ ID NO:3). For 
example, SEQ ID NO:3 was used to search the GenBank sequence database using the 

25 TBLASTN algorithm within the BLAST series of database search programs. GenBank is 
the National Institutes of Health (NIH) genetic sequence database, an annotated collection 
of all publicly available DNA sequences (Benson, D.A. et al y Nucleic Acids Research, 
vol. 27,12-7, 1999). There are approximately 3,841,000,000 bases in 4,865,000 sequence 
records as of October 1999. GenBank is available for searching via several methods, and 

30 may be accessed through the internet Genbank website that is maintained by the National 
Center for Biotechnology Information of the National Library of Medicine of the National 
Institutes of Health (internet address: 

http://www.ncbi.nlm.nih.gov/Genbank/GenbankSearch.html). GenBank is part of the 
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International Nucleotide Sequence Database Collaboration, which is comprised of the DNA 
DataBank of Japan (DDBJ), the European Molecular Biology Laboratory (EMBL), and 
GenBank at NCBI. These three organizations exchange data on a daily basis. The BLAST 
algorithm, which stands for Basic Local Alignment Search Tool is suitable for determining 
5 sequence similarity (Altschul et al., J. Mol. Biol., 1990, 215, 403-410, which is 
incorporated herein by reference in its entirety). Software for performing BLAST analyses 
is publicly available through the National Center for Biotechnology Information 
(http://www.ncbi.nlm.nih.gov/). The BLAST algorithm involves first identifying high 
scoring sequence pair (HSPs) by identifying short words of length W in the query sequence 

10 that either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood word 
score threshold (Altschul et al., supra). These initial neighborhood word hits act as seeds 
for initiating searches to find HSPs containing them. The word hits are extended in both 
directions along each sequence for as far as the cumulative alignment score can be 

15 increased. Extensions for the word hits in each direction are halted when: 1) the 
cumulative alignment score falls off by the quantity X from its maximum achieved value; 
2) the cumulative score goes to zero or below, due to the accumulation of one or more 
negative-scoring residue alignments; or 3) the end of either sequence is reached. The 
BLAST algorithm parameters W, T and X determine the sensitivity, and speed of the 

20 alignment. The BLAST program uses as defaults a word length (W) of 1 1, the BLOSUM62 
scoring matrix (see Henikoff et at., Proc. Natl. Acad. Sci. USA, 1992, 89, 10915-10919, 
which is incorporated herein by reference in its entirety) alignments (B) of 50, expectation 
(E) of 10, M=5, N=4, and a comparison of both strands. 

The BLAST algorithm (Karlin et at., Proc. Natl. Acad. Sci. USA, 1993, 90, 

25 5873-5787, which is incorporated herein by reference in its entirety) and Gapped BLAST 
(Altschul et ai y Nuc. Acids Res., 1997, 25, 3389, which is incorporated herein by reference 
in its entirety) perform a statistical analysis of the similarity between two sequences. One 
measure of similarity provided by the BLAST algorithm is the smallest sum probability 
(P(N)), which provides an indication of the probability by which a match between two 

30 nucleotide or amino acid sequences would occur by chance. For example, a nucleic acid is 
considered similar to a pak5 gene or cDNA if the smallest sum probability in comparison 
of the test nucleic acid to a PAK5 nucleic acid is less than about 1, preferably less than 
about 0.1, more preferably less than about 0.01, and most preferably less than about 0.001. 
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TBLASTN is a variant of BLAST provided by the NCBI at the GenBank internet site (vide 
supra) which may be used for comparing amino acid sequences to nucleotide sequences, 
and which in this case compares the protein query sequence against GenBank nucleotide 
database dynamically translated in all reading frames. 

Searching GenBank using TBLASTN with amino acid sequences reported 
in SEQ ID NO:3 as query sequence revealed that the first 110 amino acids of PAK5 
polypeptide (SEQ ID NO:3) share approximately 66% identity and 83% similarity with the 
corresponding region of human PAK4: this stretch of sequence includes the presence of a 
GBD/CRIB domain motif, which in PAK5 corresponds to the residues approximately 11- 
53 of SEQ ID NO:3. The GBD/CRIB motif is found in all PAKs, as well as in many other 
proteins that bind RAC and Cdc42, and has been shown to be essential for interaction of 
proteins with the GTPases (Burbelo et al. y J. Biol. Chem., vol 270, 29071-29074, 1995). 
Asides from this region of homology with PAK4, the remaining N-terminal region of 
PAKS (ca. residues 110 to 449 of SEQ ID NO:3), shares no significant homology with any 
other known protein. The predicted kinase catalytic domain of the novel PAKS (SEQ ID 
NO:3 residues approximately 449 to 700) is however highly similar to the kinase domains 
of the PAK family, and for example has approximately 85% identity and 92% similarity 
with the kinase domain of human PAK4, approximately 54% identity and 76% similarity 
with the kinase domain of human PAK3, approximately 54% identity and 75% similarity 
with the kinase domain of human PAK1, and approximately 53% identity and 73% 
similarity with the kinase domain of human PAK2. Like all PAK family members, the 
kinase domain of PAKS contains the 11 subdomains that are characteristic of 
serine/threonine protein kinases (for an analysis of kinase subdomain sequences see;, e.g. 
Hanks, S.K. and Quinn, A.M., Methods Enzymol. Vol 200, pp.38-62; Hardie, G., Hanks, S. 
et al The Protein Kinase Factsbook, Academic Press Inc; ISBN: 0123247195). Residue 
positions 456-463 (GEGSTGIV) of PAKS for instance, correspond to the consensus kinase 
subdomain I GxGxxGxV, Subdomain II is involved in the phosphotransfer reaction and is 
identified by an invariant lysine in the tripeptide sequence AxK. With regard to the novel 
kinase SEQ ID NO:3, subdomain II is found in residues 476-478 (AVK). Subdomains VI 
through IX, characterized by a large number of highly conserved residues, form the central 
core of catalytic activity. SEQ ID NO:3 comprises Region VIB contains the consensus 
sequence HRDLxxxN; SEQ ID NO:3 in this region is HRDIKSDS (wherein the 
substitutions of He for Leu and of Ser for Asn are conservative) as well as the invariant or 
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nearly invariant residues Asp586, Phe587 and Glysss in subdomain VII; all of which have 
been implicated in ATP binding. The conserved D in subdomain VII (Aspsse) functions to 
orient the y-phosphate of the ATP for transfer. Subdomain VIII of SEQ ID NO:3 contains 
the highly conserved APE sequence (residues 610-613), with the Glu corresponding to the 
5. invariant G1U6I3. The sequence DxWS/AxG of subdomain IX is represented by amino acid 
positions 625-630 of SEQ ID NO:3 (DIWSLG). This region forms a large a-helix and the 
initial Asp of the consensus sequence serves to stabilize the catalytic loop by hydrogen 
bonding. 

The activity of PAK5 polypeptide of the invention can therefore be 

10 determined by, for example, assaying for kinase activity of PAK5. In such assays PAK5 
polypeptide or a fragment thereof produced by recombinant means as described above is 
contacted with a substrate in the presence of a suitable phosphate donor, preferably ATP, 
containing radiolabeled phosphate, and PAK5-dependent incorporation of radiolabel into 
the substrate is measured. By 'substrate', one means any substance containing a suitable 

15 hydroxyl moiety that acts as an acceptor for the y-phosphate group transferred from a donor 
moiecule such as ATP in a reaction catalyzed by PAK5. The substrate may be an 
endogenous substrate of PAK5, i.e. a naturally-occurring substance that is phosphorylated 
in unmodified cells by naturally-occurring PAK5, or any other substance that is not 
normally phosphorylated by PAK5 in a physiological situation, by that may be 

20 phosphorylated by PAK5 in the reaction conditions employed. Preferably, the substrate is 
a protein or peptide, and preferably, the phosphorylation reaction occurs on a substrate 
serine or threonine residue. It is well-known to those skilled in the art that non-natural 
substrates can act as suitable substrates in kinase assays such as that described above, and 
examples of specific substrates which are commonly employed in such assays include, but 

25 are not limited to, histone proteins and myelin basic protein. It is also well known to those 
skilled in the art that detection of kinase-dependent substrate phosphorylation can be 
effected by a number of means other than measurement of radiolabeled phosphate 
incorporation into the substrate. For example, incorporation of phosphate groups can affect 
physicochemical properties of the substrate, such as electrophoretic mobility, light 

30 absorbance, fluorescence and/or phosphorescence, chromatographic properties, and so on. 
Such alterations of substrate physicochemical properties can be readily measured by one 
skilled in the art, and used as an indicator of kinase activity. Alternatively, it is also well 
known that monoclonal or polyclonal antibodies can be generated which selectively 
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recognize phosphorylated forms of the substrate, and thus the degree of binding of such 
antibodies to substrate subsequent to the kinase reaction can be used as an indirect method 
of determining kinase activity. Furthermore, it is known that many kinases, including PAK 
kinases, possess the capacity to phosphorylate residues on the same kinase molecule. Such 
5 phosphorylation reactions are termed autophosphorylation, and therefore measurement of 
incorporation of phosphate into PAK5 itself catalyzed by the same may also be used to 
monitor PAK5 activity. Kinase assays such as those described above can be performed not 
only using purified, or partially purified recombinant PAK5, but also PAK5 which is 
purified from cells which naturally express the protein using purification procedures such 

10 as those described above. 

In addition, PAK5 activity may not be measured solely by assaying the 
kinase activity, but also by the detection of events that lead to, or are consequent of PAK5 
activity in intact cells, or cell lysates, or in systems in which signaling events are 
reconstituted in vitro. For example, it is known that PAK proteins bind to, and are 

15 activated by GTP-binding proteins such as Rac and Cdc42. Thus detection of interaction of 
PAK5 with naturally occurring activators such as Rac/Cdc42, and/or substrates may also be 
used as an indicator of PAK5 activity. 

In order to identify compounds capable of modulating PAK5 activity, assays 
such as, but not limited to, those described above can be performed in the presence and 

20 absence of test compounds. 

Other assays can be used to examine enzymatic activity including, but not 
limited to, photometric, radiometric, HPLC, electrochemical, and the like, which are 
described in, for example, Enzyme Assays: A Practical Approach, eds. R. Eisenthal and M. 
J. Danson, 1992, Oxford University Press, which is incorporated herein by reference in its 

25 entirety. 

In preferred embodiments of the invention, methods of screening for 
compounds which modulate PAK5 activity comprise contacting the compound with PAK5 
and assaying for the presence of a complex between the compound and PAK5. In such 
assays, PAK5 is typically labeled. After suitable incubation, free PAK5 is separated from 
30 that present in bound form, and the amount of free or uncomplexed label is a measure of 
the ability of the particular compound to bind to PAK5. 

In another embodiment of the invention, high throughput screening for 
compounds having suitable binding affinity to PAK5 is employed. Briefly, large numbers 
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of different small peptide test compounds are synthesized on a solid substrate. The peptide 
test compounds are contacted with PAK5 and washed. Bound PAK5 is then detected by 
methods well known in the art. 

Purified polypeptides of the invention can also be coated directly onto plates 
5 for use in the aforementioned drug screening techniques. In addition, non-neutralizing 
antibodies can be used to capture the protein and immobilize it on the solid support. 

Other embodiments of the invention comprise using competitive screening 
assays in which neutralizing antibodies capable of binding a polypeptide of the invention 
specifically compete with a test compound for binding to the polypeptide. In this manner, 
10 the antibodies can be used to detect the presence of any peptide which shares one or more 
antigenic determinants with PAK5. Radiolabeled competitive binding studies are described 
in A.H. Lin et al. Antimicrobial Agents and Chemotherapy, 1997, vol. 41, no. 10. pp. 2127- 
2131, the disclosure of which is incorporated herein by reference in its entirety. 

In other embodiments of the invention, the polypeptides of the invention are 
15 employed as a research tool for identification, characterization and purification of 
interacting, regulatory proteins. Appropriate labels are incorporated into the polypeptides 
of the invention by various methods known in the art and the polypeptides are used to 
capture interacting molecules. For example, molecules are incubated with the labeled 
polypeptides, washed to remove unbound polypeptides, and the polypeptide complex is 
20 quantified. Data obtained using different concentrations of polypeptide are used to 
calculate values for the number, affinity, and association of polypeptide with the protein 
complex. 

Labeled polypeptides are also useful as reagents for the purification of 
molecules with which the polypeptide interacts including, but not limited to, inhibitors. In 

25 one embodiment of affinity purification, a polypeptide is covalently coupled to a 
chromatography column. Cells and their membranes are extracted, and various cellular 
subcomponents are passed over the column. Molecules bind to the column by virtue of 
their affinity to the polypeptide. The polypeptide-complex is recovered from the column, 
dissociated and the recovered molecule is subjected to protein sequencing. This amino acid 

30 sequence is then used to identify the captured molecule or to design degenerate 
oligonucleotides for cloning the corresponding gene from an appropriate cDNA library. 

Alternatively, compounds may be identified which exhibit similar properties 
to PAK5 of the invention, but which are smaller and exhibit a longer half-life than PAK5 in 
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a human or animal body. When an organic compound is designed, a molecule according to 
the invention is used as a "lead" compound. The design of mimetics to known 
pharmaceutical ly active compounds is a well-known approach in the development of 
pharmaceuticals based on such "lead" compounds. Mimetic design, synthesis and testing 
5 are generally used to avoid randomly screening a large number of molecules for a target 
property. Furthermore, structural data deriving from the analysis of the deduced amino 
acid sequences encoded by the DNAs of the present invention are useful to design new 
drugs, more specific and therefore with a higher pharmacological potency. 

As discussed above, comparison of the protein sequence of the present 

10 invention with the sequences present in all the available databases showed a significant 
homology with the GBD/CRIB domain and serine/threonine protein kinase domains. 
Accordingly, computer modeling can be used to develop a putative tertiary structure of the 
proteins of the invention based on the available information of other GBD/CRIB or Kinase 
domain proteins. Thus, novel enzyme inhibitors based on the predicted structure of PAK5 

15 can be designed. 

In a particular embodiment, the novel molecules identified by the screening 
methods according to the invention are low molecular weight organic molecules, in which 
case a composition or pharmaceutical composition can be prepared thereof for oral intake, 
such as in tablets. The compositions, or pharmaceutical compositions, comprising the 

20 nucleic acid molecules, vectors, polypeptides, antibodies and compounds identified by the 
screening methods described herein, can be prepared for any route of administration 
including, but not limited to, oral, intravenous, cutaneous, subcutaneous, nasal, 
intramuscular or intraperitoneal. The nature of the carrier or other ingredients will depend 
on the specific route of administration and particular embodiment of the invention to be 

25 administered. Examples of techniques and protocols that are useful in this context are, 
inter alia, found in Remington's Pharmaceutical Sciences, 16 lh edition, Osol, A (ed.), 1980, 
which is incorporated herein by reference in its entirety. 

The dosage of these low molecular weight compounds will depend on the 
disease state or condition to be treated and other clinical factors such as weight and 

30 condition of the human or animal and the route of administration of the compound. For 
treating human or animals, between approximately 0.5 mg/kg of body weight to 500 mg/kg 
of body weight of the compound can be administered. Therapy is typically administered at 
lower dosages and is continued until the desired therapeutic outcome is observed. 
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The present compounds and methods, including nucleic acid molecules, 
polypeptides, antibodies, compounds identified by the screening methods described herein, 
have a variety of pharmaceutical applications and may be used, for example, to treat or 
prevent unregulated cellular growth, such as cancer cell and tumor growth. In a particular 
embodiment, the present molecules are used in gene therapy. For a review of gene therapy 
procedures, see e.g. Anderson, Science, 1992, 256, 808-813, which is incorporated herein 
by reference in its entirety. 

The sequences of the invention are found below in Table 2. 

TABLE 2 



Sequence ID NO: 1 


pak5 










l 


gagaccggga 


acatggcgct 


gggagcnctg 


tagcagctga 


gaaggggctg 


aggcaccgcc 


61 


gcttcgctga 


cagccggcca 


ccagatgttc 


atgcattcta 


gagaaagtgg 


aaaacttaga 


121 


agcctaatta 


atgactgtct 


tctggacctc 


tgagaccatg 


tttctagtgt 


tttccgtgga 


181 


atattatcag 


aaatacactg 


tggtgaaatg 


cttccacctc 


ttgctaaaat 


gaacactgag 


241 


gaaaaatgaa 


gaagactgac 


aagcaccagc 


gaaaagttgc 


agaatagaaa 


Cagccacact 


301 


cctctggagt 


ctttaattca 


tccacagcca 


tcatataaag 


gttttggcat 


catgtttggg 


361 


aagaaaaaga 


aaaagattga 


aatatctggc 


ccgtccaact 


ttgaacacag 


ggttcatact 


421 


gggtttgatc 


cacaagagca 


gaagtttacc 


ggccttcccc 


agcagtggca 


cagcctgtta 


481 


gcagatacgg 


ccaacaggcc 


aaagcctatg 


gtggaccctt 


catgcatcac 


acccatccag 


541 


ctggctccta 


tgaagacaat 


cgttagagga 


aacaaaccct 


gcaaggaaac 


ctccatcaac 


601 


ggcctgctag 


aggattttga 


caacatctcg 


gtgactcgct 


ccaactccct 


aaggaaagaa 


661 


agcccaccca 


ccccagatca 


gggagcctcc 


agccacggtc 


caggccacgc 


ggaagaaaat 


721 


ggcttcatca 


ccttctccca 


gtattccagc 


gaatccgata 


ctactgctga 


ctacacgacc 


781 


gaaaagtaca 


gggagaagag 


tctctatgga 


gatgatctgg 


atccgtatta 


tagaggcagc 


841 


cacgcagcca 


agcaaaatgg 


gcacgtaatg 


aaaatgaagc 


acggggaggc 


ctactattct 


901 


gaggtgaagc 


ctttgaaatc 


cgattttgcc 


agattttctg 


ccgattatca 


ctcacatttg 


961 


gactcactga 


gcaaaccaag 


tgaatacagt 


gacctcaagt 


gggagtatca 


gagagcctcg 


1021 


agtagctccc 


ctctggatta 


ttcattccaa 


ttcacacctt 


ctagaactgc 


agcjcjaccagc 


1081 


gggtgctcca 


aggagagcct 


ggcgtacagt 


gaaagtgaat 


ggggacccag 


cctggatgac 


1141 


tatgacagga 


ggccaaagtc 


ttcgtacctg 


aatcagacaa 


gccctcagcc 


caccatgcgg 


1201 


cagaggtcca 


ggtcaggctc 


gggactccag 


gaaccgatga 


tgccatttgg 


agcaagtgca 


1261 


tttaaaaccc 


atccccaagg 


acactcctac 


aactcctaca 


cctaccctcg 


cttgtccgag 


1321 


cccacaatgt 


gcattccaaa 


ggtggattac 


gatcgagcac 


agatggtcct 


cagccctcca 


1381 


ctgtcagggt 


ctgacaccta 


ccccaggggc 


cctgccaaac 


tacctcaaag 


tcaaagcaaa 


1441 


tcgggctatt 


cctcaagcag 


tcaccagtac 


ccgtctgggt 


accacaaagc 


caccttgtac 


1501 


catcacccct 


ccctgcagag 


cagttcgcag 


tacatctcca 


cggcttccta 


cctgagctcc 


1561 


ctcagcctct 


catccagcac 


ctacccgccg 


cccagctggg 


gctccccctc 


cgaccagcag 
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1 bZ 1 


ccctccaggg 


tgtcccatga 


acagtttcgg 


gcggccctgc 


agctggtggt 


cagcccagga 


1 bo 1 


gaccccaggg 


aatacttggc 


caactttatc 


aaaatcgggg 


aaggctcaac 


cggcatcgta 


1741 


tgcatcggca 


ccgagaaaca 


cacagggaaa 


caagttgcag 


tgaagaaaat 


ggacctccgg 


1801 


aagcaacaga 


gacgagaact 


gcttttcaat 


gaggtcgtga 


tcatgcggga 


ttaccaccat 


1861 


gacaatgtgg 


ttgacatgta 


cagcagctac 


cttgtcggcg 


atgagctctg 


ggtggtcatg 


1921 


gagtttctag 


aaggtggtgc 


cttgacagac 


attgtgactc 


acaccagaat 


gaatgaagaa 


1981 


cagatagcta 


ctgtctgcct 


gtcagttctg 


agagctctct 


cctaccttca 


taaccaagga 


2041 


gtgattcaca 


gggacataaa 


aagtgactcc 


atcctcctga 


caagcgatgg 


ccggataaag 


2101 


ttgtctgatt 


ttggtttctg 


tgctcaagtt 


tccaaagagg 


tgccgaagag 


gaaatcattg 


2161 


gttggcactc 


cctactggat 


ggcccctgag 


ctgatttcta 


ggctacctta 


tgggacagag 


2221 


gtggacatct 


ggtccctcgg 


gatcatggtg 


atagaaatga 


ttgatggcga 


gcccccctac 


2281 


ttcaatgagc 


ctcccctcca 


ggcgatgcgg aggatccggg 


acagtttacc 


tccaagagtg 


2341 


aaggacctac 


acaaggtttc 


ttcagtgctc 


cggggattcc 


tagacttgat 


gttggtgagg 


2401 


gagccctctc 


agagagcaac 


agcccaggaa 


ctcctcggac 


atccattctt 


aaaactagca 


2461 


ggtccaccgt 


cttgcatcgt 


ccccctcatg 


agacaataca 


ggcatcactg 


a 



Sequence ID NO:2 pak5 ORF 

20 





1 


atgtttggga 


agaaaaagaa 


aaagattgaa 


atatctggcc 


cgtccaactt 


tgaacacagg 




61 


gttcatactg 


ggtttgatcc 


acaagagcag 


aagtttaccg 


gccttcccca 


gcagtggcac 




121 


agcctgttag 


cagatacggc 


caacaggcca 


aagcctatgg 


tggacccttc 


atgcatcaca 




181 


cccatccagc 


tggctcctat 


gaagacaatc 


gttagaggaa 


acaaaccctg 


caaggaaacc 


25 


241 


tccatcaacg 


gcctgctaga 


ggattttgac 


aacatctcgg 


tgactcgctc 


caactcccta 




301 


aggaaagaaa 


gcccacccac 


cccagatcag 


ggagcctcca 


gccacggtcc 


aggccacgcg 




361 


gaagaaaatg 


gcttcatcac 


cttctcccag 


tattccagcg 


aatccgatac 


tactgctgac 




421 


tacacgaccg 


aaaagtacag 


ggagaagagt 


ctctatggag 


atgatctgga 


tccgtattat 




481 


agaggcagcc 


acgcagccaa 


gcaaaatggg 


cacgtaatga 


aaatgaagca 


cggggaggcc 


30 


541 


tactattctg 


aggtgaagcc 


cttgaaatcc 


gattttgcca 


gattttctgc 


cgattatcac 




601 


tcacatttgg 


actcactgag 


caaaccaagt 


gaatacagtg 


acctcaagtg 


ggagtatcag 




661 


agagcctcga 


gtagctcccc 


tctggattat 


tcattccaat 


tcacaccttc 


tagaactgca 




.721 


gggaccagcg 


ggtgctccaa 


ggagagcctg 


gcgtacagtg 


aaagtgaatg 


gggacccagc 




781 


ctggatgact 


atgacaggag 


gccaaagtct 


tcgtacctga 


atcagacaag 


ccctcagccc 


35 


841 


accatgcggc 


agaggtccag 


gtcaggctcg 


ggactccagg 


aaccgatgat 


gccatttgga 




901 


gcaagtgcat 


ttaaaaccca 


tccccaagga cactcctaca 


actcctacac 


ctaccctcgc 




961 


ttgtccgagc 


ccacaatgtg 


cattccaaag gtggattacg 


atcgagcaca 


gatggtcctc 




1021 


agccctccac 


tgtcagggtc 


tgacacctac 


cccaggggcc 


ctgccaaact 


acctcaaagt 




1081 


caaagcaaat 


cgggctattc 


ctcaagcagt 


caccagtacc 


cgtctgggta 


ccacaaagcc 


40 


1141 


accttgtacc 


atcacccctc 


cctgcagagc 


agttcgcagt 


acatctccac 


ggcttcctac 
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1201 


ctgagctccc 


tcagcctctc 


atccagcacc 


tacccgccgc 


ccagctgggg 


ctcctcctcc 


1261 


gaccagcagc 


cctccagggt 


gtcccatgaa 


cagtttcggg 


cggccctgca 


gctggtggtc 


1321 


agcccaggag 


accccaggga 


atacttggcc 


aactttatca 


aaatcgggga 


aggctcaacc 


1381 


ggcatcgtat 


gcatcggcac 


cgagaaacac 


acagggaaac 


aagttgcagt 


gaagaaaatg 


1441 


gacctccgga 


agcaacagag 


acgagaactg 


cttttcaatg 


aggtcgtgat 


catgcgggat 


1501 


taccaccatg 


acaatgtggt 


tgacatgtac 


agcagctacc 


ttgtcggcga 


tgagctctgg 


1561 


gtggtcatgg 


agtttctaga 


aggtggtgcc 


ttgacagaca 


ttgtgactca 


caccagaatg 


1621 


aatgaagaac 


agatagctac 


tgtctgcctg 


tcagttctga 


gagctctctc 


ctaccttcat 


1681 


aaccaaggag 


tgattcacag 


ggacataaaa 


agtgactcca 


tcctcctgac 


aagcgatggc 


1741 


cggataaagt 


tgtctgattt 


tggtttctgt 


gctcaagttt 


ccaaagaggt 


gccgaagagg 


1801 


aaatcattgg 


ttggcactcc 


ctactggatg 


gcccctgagc 


tgatttctag 


gctaccttat 


1861 


gggacagagg 


tggacatctg 


gtccctcggg 


atcatggtga 


tagaaatgat 


tgatggcgag 


1921 


cccccctact 


tcaatgagcc 


tcccctccag 


gcgatgcgga 


ggatccggga 


cagtttacct 


1981 


ccaagagtga 


aggacctaca 


caaggtttct 


tcagtgctcc 


ggggattcct 


agacttgatg 


2041 


ttggtgaggg 


agccctctca 


gagagcaaca 


gcccaggaac 


tcctcggaca 


tccattctta 


2101 


aaactagcag 


gtccaccgtc 


ttgcatcgtc 


cccctcatga 


gacaatacag 


gcatcac 



Sequence ID NO:3 PAK5 amino acid sequence 

20 



1 


MFGKKKKKIE 


ISGPSNFEHR 


VHTGFDPQEQ KFTGLPQQWH 


SLLADTANRP 


KPMVDPSCIT 


61 


PIQLAPMKTI 


VRGNKPCKET 


SINGLLEDFD 


NISVTRSNSL 


RKESPPTPDQ 


GASSHGPGHA 


121 


EENGFITFSQ 


YSSESDTTAD 


YTTEKYREKS 


LYGDDLDPYY 


RGSHAAKQNG 


HVMKMKHGEA 


181 


YYSEVKPLKS 


DFARFSADYH 


SHLDSLSKPS 


EYSDLKWEYQ 


RASSSSPLDY 


SFQFTPSRTA 


241 


GTSGCSKESL 


AYSESEWGPS 


LDDYDRRPKS 


SYLNQTSPQP 


TMRQRSRSGS 


GLQEPMMPFG 


301 


ASAFKTHPQG 


HSYNSYTYPR 


LSEPTMCIPK 


VDYDRAQMVL 


SPPLSGSDTY 


PRGPAKLPQS 


361 


QSKSGYSSSS 


HQYPSGYHKA 


TLYHHPSLQS 


SSQYISTASY 


LSSLSLSSST 


YPPPSWGSSS 


421 


DQQPSRVSHE 


QFRAALQLW 


SPGDPREYLA 


NFIKIGEGST 


GIVCIGTEKH 


TGKQVAVKKM 


481 


DLRKQQRREL 


LFNEWIMRD 


YHHDNWDMY 


SSYLVGDELW 


WMEFLEGGA 


LTDIVTHTRM 


541 


NEEQIATVCL 


SVLRALSYLH 


NQGVIHRDIK 


SDSILLTSDG 


RIKLSDFGFC 


AQVSKEVPKR 


601 


KSLVGTPYWM 


APELISRLPY 


GTEVDIWSLG 


IMVIEMIDGE 


PPYFNEPPLQ 


AMRRIRDSLP 


661 


PRVKDLHKVS 


SVLRGFLDLM 


LVREPSQRAT AQELLGHPFL 


KLAGPPSCIV 


PLMRQYRHH 



35 

SEQ ID NO:4 Human PAK1 CDS from GenBank 



1 atgtcaaata acggcctaga cattcaagac aaacccccag cccctccgat gagaaatacc 
61 agcactatga ttggagccgg cagcaaagat gctggaaccc taaaccatgg ttctaaacct 
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121 


ctgcctccaa 


acccagagga 


gaagaaaaag 


aaggaccgat 


tttaccgatc 


cattttacct 




181 


ggagataaaa 


caaataaaaa 


gaaagagaaa 


gagcggccag 


agatttctct 


cccttcagat 




241 


tttgaacaca 


caattcatgt 


cggttttgat 


gctgtcacag 


gggagtttac 


cggaatgcca 




301 


gagcagtggg 


cccgcttgct 


tcagacatca 


aatatcacta 


agtcggagca 


gaagaaaaac 


5 


361 


ccgcaggctg 


ttctggatgt 


gttggagttt 


tacaactcga 


agaagacatc 


caacagccag 




421 


aaatacatga 


gctttacaga 


taagtcagct 


gaggattaca 


attcttctaa 


tgccttgaat 




481 


gtgaaggctg 


. tgtctgagac 


tcctgcagtg 


ccaccagttt 


cagaagatga 


ggatgatgat 




541 


gatgatgatg 


ctaccccacc 


accagtgatt 


gctccacgcc 


cagagcacac 


aaaatctgta 




601 


tacacacggt 


ctgtgattga 


accacttcct 


gtcactccaa 


ctcgggacgt 


ggctacatct 


10 


661 


cccatttcac 


ctactgaaaa 


taacaccact 


ccaccagatg 


ctttgaccct 


taatactgag 




721 


aagcagaaga 


agaagcctaa 


aatgtctgat 


gaggagatct 


tggagaaatt 


acgaagcata 




781 


gtgagtgtgg 


gcgatcctaa 


gaagaaatat 


acacggtttg 


agaagattgg 


acaaggtgct 




841 


tcaggcaccg 


tgtacacagc 


aatggatgtg 


gccacaggac 


aggaggtggc 


cattaagcag 




901 


atgaatcttc 


agcagcagcc 


caagaaagag 


ctgattatta 


atgagatcct 


ggtcatgagg 


15 


961 


gaaaacaaga 


acccaaacat 


tgtgaattac 


ttggacagtt 


acctcgtggg 


agatgagctg 




1021 


tgggttgtta 


tggaatactt 


ggctggaggc 


tccttgacag 


atgtggtgac 


agaaacttgc 




1081 


atggatgaag 


gccaaattgc 


agctgtgtgc 


cgtgagtgtc 


tgcaggctct 


ggagtctttg 




1141 


cattcgaacc 


aggtcattca 


cagagacatc 


aagagtgaca 


atattctgtt 


gggaatggat 




1201 


ggctctgtca 


agctaactga 


ctttggattc 


tgtgcacaga 


taaccccaga 


geagagcaaa 


20 


1261 


cggagcacca 


tggtaggaac 


cccatactgg 


atggcaccag 


aggttgtgac 


acgaaaggcc 




1321 


tatgggccca 


aggttgacat 


ctggtccctg 


ggcatcatgg 


ccatcgaaat 


gattgaaggg 




1381 


gagcctccat 


acctcaatga 


aaaccctctg 


agagccttgt 


acctcattgc 


caccaatggg 




1441 


accccagaac 


ttcagaaccc 


agagaagctg 


tcagctatct 


tccgggactt 


tctgaaccgc 




1501 


tgtctcgaga 


tggatgtgga 


gaagagaggt 


tcagctaaag 


agctgctaca 


gcatcaattc 


25 


1561 
1621 


ctgaagattg 
acaaagaaca 


ccaagcccct 
atcactaa 


ctccagcctc 


actccactga 


ttgctgcagc 


taaggaggca 




SEQ ID NO:5 Human PAK2 CDS from GenBank 






30 


l 


atgtctgata 


acggagaact 


ggaagataag 


cctccagcac 


ctcctgtgcg 


aatgagcagc 




61 


accatcttta 


gcactggagg 


caaagaccct 


ttgtcagcca 


atcacagttt 


gaaacctttg 




121 


ccctctgttc 


cagaagagaa 


aaagcccagg 


cataaaatca 


tctccatatt 


ctcaggcaca 




181 


gagaaaggaa 


gtaaaaagaa 


agaaaaggaa 


cggccagaaa 


tttctcctcc 


atctgatttt 




241 


gagcacacca 


tccatgttgg 


ctttgatgct 


gttactggag 


aattcactgg 


catgccagaa 


35 


301 


cagtgggctc 


gattactaca 


gacctccaat 


atcaccaaac 


tagagcaaaa 


gaagaatcct 




361 


caggctgtgc 


tggatgtcct 


aaagttctac 


gactccaaca 


cagtgaagca 


gaaatatctg 




421 


agctttactc 


ctcctgagaa 


agatggcctt 


ccttctggaa 


cgccagcact 


gaatgccaag 




481 


ggaacagaag 


cacccgcagt 


agtgacagag 


gaggaggatg 


atgatgaaga 


gactgctcct 




541 


cccgttattg 


ccccgcgacc 


ggatcatacg 


aaatcaattt 


acacacggtc 


tgtaattgac 
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601 


cctgttcctg 


caccagttgg 


tgattcacat 


gttgatggtg 


ctgccaagtc 


tttagacaaa 


C C 1 

661 


cagaaaaaga 


agcctaagat 


gacagatgaa 


gagattatgg 


agaaattaag 


aactatcgtg 


721 


agcataggtg 


accctaagaa 


aaaatataca 


agatatgaaa 


aaattggaca 


aggggcttct 


781 


ggtacagttt 


tcactgctac 


tgacgttgca 


ctgggacagg 


aggttgctat 


caaacaaatt 


841 


aatttacaga 


aacagccaaa 


gaaggaactg 


atcattaacg 


agattctggt 


gatgaaagaa 


901 


ttgaaaaatc 


ccaacatcgt 


taactttttg 


gacagttacc 


tggtaggaga 


tgaattgttt 


961 


gtggtcatgg 


aataccttgc 


tggggggtca 


ctcactgatg 


tggtaacaga 


aacagcttgc 


1021 


atggatgaag 


cacagattgc 


tgctgtatgc 


agagagtgtt 


tacaggcatt 


ggagttttta 


1081 


catgctaatc 


aagtgatcca 


cagagacatc 


aaaagtgaca 


atgtactttt 


gggaatggaa 


1141 


ggatctgtta 


agctcactga 


ctttggtttc 


tgtgcccaga 


tcacccctga 


gcagagcaaa 


1201 


cgcagtacca 


tggtcggaac 


gccatactgg 


atggcaccag 


aggtggttac 


acggaaagct 


1261 


tatggcccta 


aagtcgacat 


atggtctctg 


ggtatcatgg 


ctattgagat 


ggtagaagga 


1321 


gagcctccat 


acctcaatga 


aaatcccttg 


agggccttgt 


acctaatagc 


aactaatgga 


1381 


accccagaac 


ttcagaatcc 


agagaaactt 


tccccaatat 


ttcgggattt 


cttaaatcga 


1441 


tgtttggaaa 


tggatgtgga 


aaaaaggggt 


tcagccaaag 


aattattaca 


gcatcctttc 


1501 


ctgaaactgg 


ccaaaccgtt 


atctagcttg 


acaccactga 


tcatggcagc 


taaagaagca 


1561 


atgaagagta 


accgttaa 











20 SEQ ID NO:6 Human PAK3 CDS from GenBank 





1 


atgtctgacg 


gtctggataa 


tgaagagaaa 


cccccggctc 


ctccactgag 


gatgaatagt 




61 


aacaaccggg 


attcttcagc 


actcaaccac 


agctccaaac 


cacttcccat 


ggcccctgaa 




121 


gagaagaata 


agaaagccag 


gcttcgctct 


atcttcccag 


gaggagggga 


taaaaccaat 


25 


181 


aagaagaagg 


agaaagagcg 


cccagagatc 


tctcttcctt 


cagactttga 


gcatacgatt 




241 


catgtggggt 


ttgatgcagt 


caccggggaa 


ttcactggaa 


ttccagagca 


atgggcacga 




301 


ttactccaaa 


cttccaacat 


aacaaaattg 


gaacagaaga 


agaacccaca 


agctgttcta 




361 


gatgttctca 


aattctatga 


ttccaaagaa 


acagtcaaca 


accagaaata 


catgagcttt 




421 


acatcaggag 


ataaaagtgc 


acatggatac 


atagcagccc 


atccttcgag 


tacaaaaaca 


30 


481 


gcatctgagc 


ctccattggc 


ccctcctgtg 


tctgaagaag 


aagatgaaga 


ggaagaagaa 




541 


gaagaagatg 


aaaatgagcc 


accaccagtt 


atcgcaccaa 


gaccagagca 


tacaaaatca 




601 


atctatactc 


gttctgtggt 


tgaatccatt 


gcttcaccag 


cagtaccaaa 


taaagaggtc 




661 


acaccaccct 


ctgctgaaaa 


tgccaattcc 


agtactttgt 


acaggaacac 


agatcggcaa 




721 


agaaaaaaat 


ccaagatgac 


agatgaggag 


atcttagaga 


agctaagaag 


cattgtgagt 


35 


781 


gttggggacc 


caaagaaaaa 


atacacaaga 


tttgaaaaaa 


ttggtcaagg 


ggcatcaggt 




841 


actgtttata 


cagcactaga 


cattgcaaca 


ggacaagagg 


tggccataaa 


gcagatgaac 




901 


cttcaacagc 


aacccaagaa 


ggaattaatt 


attaatgaaa 


ttctggtcat 


gagggaaaat 




961 


aagaacccta 


atattgttaa 


ttatttagat 


agctacttgg 


tgggtgatga 


actatgggta 




1021 


gtcatggaat 


acttggctgg 


tggctctctg 


actgatgtgg 


tcacagagac 


ctgtatggat 


40 


1081 


gaaggacaga 


tagcagctgt 


ctgcagagag 


tgcctgcaag 


ctttggattt 


cctgcactca 
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1141 


aaccaggtga 


tccatagaga 


tataaagagt 


gacaatattc 


ttctcgggat 


ggatggctct 


1201 


gttaaattga 


ctgactttgg 


gttctgtgcc 


cagatcactc 


ctgagcaaag 


taaacgaagc 


1261 


actatggtgg 


gaaccccata 


ttggatggca 


cctgaggtgg 


tgactcgaaa 


agcttatggt 


1321 


ccgaaagttg 


atatctggtc 


tcttggaatt 


atggcaattg 


aaatggtgga 


aggtgaaccc 


1381 


ccttacctta 


atgaaaatcc 


actcagggca 


ttgtatctga 


tagccactaa 


tggaactcca 


1441 


gagctccaga 


atcctgagag 


actgtcagct 


gtattccgtg 


actttttaaa 


tcgctgtctt 


1501 


gagatggatg 


tggataggcg 


aggatctgcc 


aaggagcttt 


tgcagcatcc 


atttttaaaa 


1561 


ttagccaagc 


ctctctccag 


cctgactcct 


ctgattatcg 


ctgcaaagga 


agcaattaag 


1621 


aacagcagcc 


gctaa 











10 



SEQ ID NO:7 Human PAK4 CDS from GenBank 

15 





1 


atgtttggga 


agaggaagaa 


gcgggtggag 


atctccgcgc 


cgtccaactt 


cgagcaccgc 




61 


gtgcacacgg 


gcttcgacca 


gcacgagcag 


aagttcacgg 


ggctgccccg 


ccagtggcag 




121 


agcctgatcg 


aggagtcggc 


tcgccggccc 


aagcccctcg 


tcgaccccgc 


ctgcatcacc 




181 


tccatccagc 


ccggggcccc 


caagaccatc 


gtgcggggca 


gcaaaggtgc 


caaagatggg 


20 


241 


gccctcacgc 


tgctgctgga 


cgagtttgag 


aacatgtcgg 


tgacacgctc 


caactccctg 




301 


cggagagaca 


gcccgccgcc 


gcccgcccgt 


gcccgccagg 


aaaatgggat 


gccagaggag 




361 


ccggccacca 


cggccagagg 


gggcccaggg 


aaggcaggca 


gccgaggccg 


gttcgccggt 




421 


cacagcgagg 


caggtggcgg 


cagtggtgac 


aggcgacggg 


cggggccaga 


gaagaggccc 




481 


aagtcttcca 


gggagggctc 


agggggtccc 


caggagtcct 


cccgggacaa 


acgccccctc 


25 


541 


tccgggcctg 


atgtcggcac 


cccccagcct 


gctggtctgg 


ccagtggggc 


gaaactggca 




601 


gctggccggc 


cctttaacac 


ctacccgagg 


gctgacacgg 


accacccatc 


ccggggtgcc 




661 


cagggggagc 


ctcatgacgt 


ggcccctaac 


gggccatcag 


cggggggcct 


ggccatcccc 




721 


cagtcctcct 


cctcctcctc 


ccggcctccc 


acccgagccc 


gaggtgcccc 


cagccctgga 




781 


gtgctgggac 


cccacgcctc 


agagccccag 


ctggcccctc 


cagcctgcac 


ccccgccgcc 


30 


841 


cctgctgttc 


ctgggccccc 


tggcccccgc 


tcaccacagc 


gggagccaca 


gcgagtatcc 




901 


catgagcagt 


tccgggctgc 


cctgcagctg 


gtggtggacc 


caggcgaccc 


ccgctcctac 




961 


ctggacaact 


tcatcaagat 


tggcgagggc 


tccacgggca 


tcgtgtgcat 


cgccaccgtg 




1021 


cgcagctcgg 


gcaagctggt 


ggccgtcaag 


aagatggacc 


tgcgcaagca 


gcagaggcgc 




1081 


gagctgctct 


tcaacgaggt 


ggtaatcatg 


agggactacc 


agcacgagaa 


tgtggtggag 


35 


1141 


atgtacaaca 


gctacctggt 


gggggacgag 


ctctgggtgg 


tcatggagtt 


cctggaagga 




1201 


ggcgccctca 


ccgacatcgt 


cacccacacc 


aggatgaacg 


aggagcagat 


cgcagccgtg 




1261 


tgccttgcag 


tgctgcaggc 


cctgtcggtg 


ctccacgccc 


agggcgtcat 


ccaccgggac 




1321 


atcaagagcg 


actcgatcct 


gctgacccat 


gatggcaggg 


tgaagctgtc 


agactttggg 




1381 


ttctgcgccc 


aggtgagcaa 


ggaagtgccc 


cgaaggaagt 


cgctggtcgg 


cacgccctac 


40 


1441 


tggatggccc 


cagagctcat 


ctcccgcctt 


ccctacgggc 


cagaggtaga 


catctggtcg 
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1501 ctggggataa tggtgattga gatggtggac ggagagcccc cctacttcaa cgagccaccc 

1561 ctcaaagcca tgaagatgat tcgggacaac ctgccacccc gactgaagaa cctgcacaag 

1621 gtgtcgccat ccctgaaggg cttcctggac cgcctgctgg tgcgagaccc tgcccagcgg 

1681 gccacggcag ccgagctgct gaagcaccca ttcctggcca aggcagggcc gcctgccagc 

1741 atcgtgcccc tcatgcgcca gaaccgcacc agatga 



SEQ ID NO:8 Incyte template 067594.1 



1 


gagaccggga 


acatggcgct 


gggagcnctg 


tagcagctga 


gaaggggctg 


aggcaccgcc 


61 


gcttcgctga 


cagccggcca 


ccagatgttc 


atgcattcta 


gagaaagtgg 


aaaacttaga 


121 


agcctaatta 


atgactgtct 


tctggacctc 


tgagaccatg 


tttctagtgt 


tttccgtgga 


181 


atattatcag 


aaatacactg 


tggtgaaatg 


cttccacctc 


ttgctaaaat 


gaacactgag 


241 


gaaaaatgaa 


gaagactgac 


aagcaccagc 


gaaaagttgc 


agaatagaaa 


tagccacact 


301 


cctctggagt 


ctttaattca 


tccacagcca 


tcatataaag 


gttttggcat 


catgtttggg 


361 


aagaaaaaga 


aaaagatbga 


aatatctggc 


ccgtccaact 


ttgaacacag 


ggttcatact 


421 


gggtttgatc 


cacaagagca 


gaagtttacc 


ggccttcccc 


agcagtggca 


cagcctgtta 


481 


gcagatacgg 


ccaacaggcc 


aaagcctatg 


gtggaccctt 


catgcatcac 


acccatccag 


541 


ctggctccta 


tgaagacatc 


gttagaggaa 


acaaaccctg 


c 





20 

SEQ ID NO:9 = gcatcatgtt tgggaagaaa --primer sequence 

SEQ ID NO:10 = a (g/c)ctc (a/t) gg( t/g) g ccatcca (g/a) ta -- primer sequence 
25 SEQ ID NO: 1 1 Insert sequence from first PCR 



1 


gcatcatgtt 


tgggaagaaa 


aagaaaaaga 


ttgaaatatc 


tggcccgtcc 


aactttgaac 


61 


acagggttca 


tactgggttt 


gatccacaag 


agcagaagtt 


taccggcctt 


ccccagcagt 


121 


ggcacagcct 


gttagcagat 


acggccaaca 


ggccaaagcc 


tatggtggac 


ccttcatgca 


181 


tcacacccat 


ccagctggct 


cctatgaaga 


caatcgttag 


aggaaacaaa 


ccctgcaagg 


241 


aaacctccat 


caacggcctg 


ctagaggatt 


ttgacaacat 


ctcggtgact 


cgctccaact 


301 


ccctaaggaa 


agaaagccca 


cccaccccag 


atcagggagc 


ctccagccac 


ggtccaggcc 


361 


acgcggaaga 


aaatggcttc 


atcaccttct 


cccagtattc 


cagcgaatcc 


gatactactg 


421 


ctgactacac 


gaccgaaaag 


tacagggaga 


agagtctcta 


tggagatgat 


ctggatccgt 


481 


attatagagg 


cagccacgca 


gccaagcaaa 


atgggcacgt 


aatgaaaatg 


aagcacgggg 


541 


aggcctacta 


ttctgaggtg 


aagcctttga 


aatccgattt 


tgccagattt 


tctgccgatt 


601 


atcactcaca 


tttggactca 


ctgagcaaac 


caagtgaata 


cagtgacctc 


aagtgggagt 


661 


atcagagagc 


ctcgagtagc 


tcccctctgg 


attattcatt 


ccaattcaca 


ccttctagaa 
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721 


ctgcagggac 


cagcgggtgc 


tccaaggaga 


781 


ccagcctgga 


tgactatgac 


aggaggccaa 


841 


agcccaccat 


gcggcagagg 


tccaggtcag 


901 


ttzggagcaag 


tgcatttaaa 


acccatcccc 


961 


ctcgcttgtc 


cgagcccaca 


atgtgcattc 


1021 


tcctcagccc 


tccactgtca 


gggtctgaca 


1081 


aaagtcaaag 


caaatcgggc 


tattcctcaa 


1141 


aagccacctt 


gtaccatcac 


ccctccctgc 


1201 


cctacctgag 


ctccctcagc 


ctctcatcca 


1261 


cctccgacca 


gcagccctcc 


agggtgtccc 


1321 


tggtcagccc 


aggagacccc 


agggaatact 


1381 


caaccggcat 


cgtatgcatc 


ggcaccgaga 


1441 


aaatggacct 


ccggaagcaa 


cagagacgag 


1501 


gggattacca 


ccatgacaat 


gtggttgaca 


1561 


tctgggtggt 


catggagttt 


ctagaaggtg 


1621 


gaatgaatga 


agaacagata 


gctactgtct 


1681 


ttcataacca 


aggagtgatt 


cacagggaca 


1741 


atggccggat 


aaagttgtct 


gattttggtt 


1801 


agaggaaatc 


attggttggc 


actccctact 



gcctggcgta cagtgaaagt gaatggggac 
agtcttcgta cctgaa'tcag acaagccctc 
gctcgggact ccaggaaccg atgatgccat 
aaggacactc ctacaactcc tacacctacc 
caaaggtgga ttacgatcga gcacagatgg 
cctaccccag gggccctgcc aaactacctc 
gcagtcacca gtacccgtct gggtaccaca 
agagcagttc gcagtacatc tccacggctt 
gcacctaccc gccgcccagc tggggctcct 
atgaacagtt tcgggcggcc ctgcagctgg 
tggccaactt tatcaaaatc ggggaaggct 
aacacacagg gaaacaagtt gcagtgaaga 
aactgctttt caatgaggtc gtgatcatgc 
tgtacagcag ctaccttgtc ggcgatgagc 
gtgccttgac agacattgtg actcacacca 
gcctgtcagt tctgagagct ctctcctacc 
taaaaagtga ctccatcctc ctgacaagcg 
tctgtgctca agtttccaaa gaggtgccga 
ggatggcccc tgagct 



SEQ ID NO: 12 Deduced from sequences with 

1 ataaagttgt ctgattttgg tttctgtgct 

61 tcattggttg gcactcccta ctggatggcc 

121 acagaggtgg acatctggtc cctcgggatc 

181 ccctacttca atgagcctcc cctccaggcg 

241 agagtgaagg acctacacaa ggtttcttca 

301 gtgagggagc cctctcagag agcaacagcc 

361 ctagcaggtc caccgtcttg catcgtcccc 



in GenBank ACCESSION AL031652 

caagtttcca aagaggtgcc gaagaggaaa 
cctgaggtga tttctaggct accttatggg 
atggtgatag aaatgattga tggcgagccc 
atgcggagga tccgggacag tttacctcca 
gtgctccggg gattcctaga cttgatgttg 
caggaactcc tcggacatcc attcttaaaa 
ctcatgagac aatacaggca tcactga 



SEQ E) NO:13 = gagaccggga acatggcgct -- primer sequence (sense) 

SEQ ID NO:14 = tcagtgatgc ctgtattgtc tc —primer sequence (antisense) 

The invention is further illustrated by way of the following examples which are 
intended to elucidate the invention. These examples are not intended, nor are they to be 
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construed, as limiting the scope of the invention. It will be clear that the invention may be 
practiced otherwise than as particularly described herein. Numerous modifications and 
variations of the present invention are possible in view of the teachings herein and, 
therefore, are within the scope of the invention. 
5 Examples 1-3 and 6-7 presented below are actual examples, while examples 4, 5 

and 8-10 are prophetic. 

In the accompanying drawings: 

Figure 1: Kinase assays performed on PAK5 immunopurified from lysates of 293 
fibroblasts, showing that this kinase is able to phosphorylate generic substrates. 
10 1A. Kinase assay performed with histone myelin basic protein (MBP) as substrate. 

Methodology was as described in Example 3. Lanes 1 and 2 correspond to 
immunoprecipitates from cells transfected with vector encoding PAK5. Lanes 3 and 4 
correspond to immunoprecipitates from cells transfected with empty control vector. In 
lanes I and 3, MBP was present in the kinase reaction, while in lanes 2 and 4, MBP was 
15 absent. Bars on the left hand side of the figure indicate the approximate positions of 
molecular weight markers, the sizes of which, from top to bottom were 100, 80, 50, 35, and 
28 kDa. Note that in the presence of MBP, PAK5 shows strong autophosphorylation. 

IB. Kinase assay performed with histone HI as substrate. Lanes 1 and 2 
correspond to immunoprecipitates from cells transfected with vector encoding PAK5. 
Lanes 3 and 4 correspond to immunoprecipitates from cells transfected with empty control 
vector. In lanes 1 and 3, histone HI was present, in lanes 2 and 4, histone HI was absent. 
Bars on the left hand side of the figure indicate the approximate positions of molecular 
weight markers, the sizes of which, from top to bottom were 120, 80, 50, 35, 27, 20 and 7 
kDa. 

Figure 2: Multiple tissue northern blots showing distribution of Pak5 mRNA in 
normal human tissues. 

2A. Analysis in several distinct tissue types (Clontech human multiple tissue blot 
30 #7760-1) shows thatTAKS (upper panel) is expressed selectively in brain. Lower panel 
shows actin probe supplied with blot as a control. 

2B. Northern analysis of sub-regions of normal human brain shows strong 
expression in several regions, indicated above the lanes. Upper panel corresponds to 



20 



25 
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Clontech human Brain blot II, #7755-1 and lower panel corresponds to Clontech human 
Brain blot IV, #7769-1. 

EXAMPLES 
5 Example 1: Identification of PAK5 

The four known human PAK coding sequences (PAKs 1-4; SEQ ED NO:4, 
SEQ ID NO:5, SEQ ID NO:6, and SEQ ID NO:7, respectively) were extracted from the 
GenBank database. The sequences were translated and aligned, the kinase domain regions 
removed from the alignment, and the remaining non-kinase region alignment was used as 

10 input for a hidden Markov model (HMM) profile generation using WiseTools software 
(Birney et al, Nucleic Acids Res 24, 2730-2739, 1996). The resulting HMM was used for 
searching the Incyte EST database. This search yielded an unknown partial sequence of 
581bp (Incyte template 067594.1, SEQ ID NO:8), for which nucleotide residues 352-581 
showed significant homology to the first 230 nucleotides of PAK-4 CDS (i.e. SEQ NO:7 

15 residues 1-230). We thus hypothesized that SEQ ID NO: 8 represents a partial cDNA clone 
for a novel member of the PAK family, corresponding to 351 bases of 5* UTR, plus the 
first 230 bases of coding sequence, with the start codon of the ORF at position 231-233. 
The ATG at this position forms part of a Kozak consensus sequence for initiation of protein 
translation (Kozak M, Cell vol. 44: 283-92, 1986). This novel sequence was designated as 

20 PAKS based on the homology to PAK4. 

Example 2: Cloning of pak5 cDNA 

A sense PCR primer (SEQ ED NO:9) was designed which specifically 
matches SEQ ID NO:8 at position 347-366, and thus specifically amplifies PAKS, but not 

25 the other known PAK family sequences very near to the 5' end of the inferred CDS. An. 
antisense primer near the 3' end of the inferred CDS was designed by aligning the 
conserved kinase domains located in the 3' region of all PAK protein family members 
using the Align Program within the VECTOR NTI Suite software (InforMax, Inc., 
Bethesda MD), and choosing by eye a suitable degenerate primer (SEQ ID NO: 10) for 

30 amplification of the putative PAKS sequence. This primer has highest homology towards 
the antisense sequence in position 1438-1457 of the human PAK-4 sequence (SEQ ED 
NO:7). Thus the primer pair, defined in SEQ ED NO:9 and SEQ ID NO: 10, when used in a 
PCR reaction under suitable conditions, and with suitable template cDNA, should 
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specifically amplify a cDNA fragment encoding most of the novel kinase coding sequence. 
Standard protocols were used for PCR (see Sambrook, 1987). An MJ Research PTC-225 
PCR cycler was used applying 30 cycles of the following conditions: 96°C for 15 seconds, 
54°C for ISseconds, 72°C for 2 minutes. cDNA obtained by reverse transcription of human 
5 fetal brain mRNA (Clontech) was used as the template in the PCR reaction. The resulting 
PCR products were subcloned into the pCR2.1 'TA" vector (Invitrogen) by the TA cloning 
approach as per kit protocols. The resulting ligation reactions were used to transform 
INVocF E. coli competent cells (Invitrogen). Single, isolated transformed E. coli colonies 
were grown in selective media (LB broth, ampicillin) overnight at 37°C and subsequently 

10 used to prepare plasmid DNA (Qiagen Plasmid DNA preparation kit). 

Inserts of two independent clones were sequenced directly using an ABI377 
fluorescence-based sequencer (Perkin Elmer/Applied Biosystems Division, PE/ABD, 
Foster City, CA) and the ABI PRISM Ready Dye-Deoxy Terminator kit with Taq FS 
polymerase. Each ABI cycle sequencing reaction contained about 0.5 pLg of plasmid DNA. 

15 Cycle-sequencing was performed using an initial denaturation at 98°C for 1 min, followed 
by 50 cycles: 98°C for 30 sec, annealing at 50°C for 30 sec, and extension at 60°C for 4 
min. Temperature cycles and times were controlled by a Perkin-Elmer 9600 thermocycler. 
Extension products were purified using Centriflex gel filtration columns (Advanced 
Genetic Technologies Corp., Gaithersburg, MD). Each reaction product was loaded by 

20 pipette onto the column, which was then centrifuged in a swinging bucket centrifuge 
(Sorvall model RT6000B tabletop centrifuge) at 1500 x g for 4 min at room temperature. 
Column-purified samples were dried under vacuum for about 40 min and then dissolved in 
5 fi\ of a DNA loading solution (83% deionized formamide, 8.3 mM EDTA, and 1.6 mg/ml 
Blue Dextran). The samples were then heated to 90°C for three min and loaded into the gel 

25 sample wells for sequence analysis. Sequence analysis was performed by importing 
ABI377 files into the Sequencher program (Gene Codes, Ann Arbor, Ml). Generally, 
sequence reads of 700 bp were obtained. Potential sequencing errors were minimized by 
obtaining sequence information from both DNA strands and by re-sequencing difficult 
areas using primers at different locations until all sequencing" ambiguities were removed. 

30 This resulted in the sequence reported in SEQ ID NO:l 1. SEQ ID NO:l 1 was used as a 
query sequence against the GenBank database. This database was searched for regions of 
similarity using Gapped BLAST. This resulted in identification of a template sequence, 
with GenBank ACCESSION # AL031652, having a statistically significant overlapping 
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homology to the query sequence. The sequence identified by template GenBank 
ACCESSION # AL031652 contains, amongst other, unrelated sequences, a deduced cDNA 
which encodes the predicted 417 3' terminal residues, including the stop codon (SEQ ID 
NO: 12), of a predicted PAKl-like kinase. The overlapping similarity extended from SEQ 
5 ID NO:ll residues 1749 to 1846, compared to template (SEQ NO: 12) residues 1 to 98, 
aligning with an overall DNA sequence identity of 98.98%. SEQ ID NO: 11 thus contained 
1748 bp of sequence 5' to the overlap, not present in the public SEQ ID NO: 12, while SEQ 
ID NO: 12 contained 319 bp 3' to the overlap, not present in SEQ ID NO:l 1. Combining 
this information, it was thus possible to infer the full-length coding sequence for PAK5 by 

10 assembling the partial coding sequences contained within SEQ ID NO:8 (i.e. Incyte 
template 067594.1), SEQ ED NO: 11 derived by PCR cloning/sequencing, and SEQ ID 
NO: 12, the deduced partial coding sequence for a "PAKl-like serine threonine kinase" 
from genomic clone AL031652. Without PCR cloning and sequencing of SEQ ID NO: 11, 
which contains most of the PAK5 sequence, and which was previously unknown, including 

15 most of the kinase catalytic domain, it would not have been possible to assign SEQ ID 
NO:8 and SEQ ID NO: 12 as belonging to the same gene. Furthermore, since SEQ ID 
NO: 12 is not an expressed sequence, but is deduced from genomic DNA, with many 
intervening non-coding sequences (introns), it would not have been possible with the 
existing information, to deduce that SEQ ID NO: 12 is, in fact, expressed as such. When 

20 the sequence of SEQ ID NO: 11 is assembled together with non-overlapping sequences 
contained within SEQ ID NO:8 (i.e. Incyte template 067594.1), and those contained within 
SEQ ID NO: 12 (i.e. a deduced partial coding sequence from genomic clone GenBank 
Accession AL031652), the full-length sequence shown in SEQ. ID. NO:l is obtained. 

In order to formally demonstrate that SEQ ID NO:8, SEQ ID NO: 11 and 

25 SEQ ID NO: 12 do, in fact, represent contiguous stretches of the same gene, the full length 
PAK5 cDNA was amplified from fetal brain cDNA by PCR using the sense primer 
described in SEQ ID NO: 13 and antisense primer described in SEQ ID NO: 14, which 
respectively cover the 5' (sense) and 3' (antisense) extremities of the sequence described in 
SEQ ID NO:l. Several independent clones from independent PCR reactions were 

30 sequenced and were found to contain the expected fragment, corresponding to nucleotide 
residues of SEQ ID NO:l. This sequence contains a 2157 bp (SEQ ID NO:2) major open 
reading frame (ORF) with a Kozak consensus sequence at the initiation ATG codon. 
Translation of the ORE resulted in a 719 amino acid protein sequence (SEQ ID NO:3) 
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which has homology with the known PAKs 1-4, and, in particular, contains the conserved 
GBD/CRDB region and kinase subdomains found in the PAK family of serine/threonine 
protein kinases (discussed above). The PAK5 protein has a predicted molecular weight of 
80759.01 Dalton, an isoelectric point of 7.72, and a net charge of 3.09 at pH 7.0. 

5 

Example 3: Assay to Identify Level of PAK5 Kinase Activity 

Human 293 cells are transfected with the mammalian expression vector pcDNA3.1 
(Invitrogen) encoding full length PAKS, as desribed in Example 7, or with control (empty 
vector), and lysed after 24 hours using M-Per mammalian cell lysis buffer (Pierce). 500 pg 

10 of solubilised cellular proteins are diluted to a final volume of 500 pi with M-Per buffer, 
and PAK5 is immunoprecipitated by mixing this lysate with 10 pi Protein A Sepharose 
(Sigma Chemical Company), and 20 pi of a polyclonal anti-PAK5 antiserum. Reactions 
are performed at 4°C for 2 hours, after which protein A-sepharose pellets containing 
immunoprecipitates are washed twice by centrifugation (10,000 Xg, 5 minutes) with 500 pi 

15 M-Per buffer, and twice with 500 pi of modified kinase reaction buffer, containing 20 mM 
HEPES (pH 7.5), 10 mM magnesium acetate, 1 mM DTT, 0.01 mM sodium orthovanadate. 
After washing, the 10 pi Protein A-Sepharose pellets are drained of excess buffer and used 
for kinase reactions, as described below. 

The anti-PAK5 antiserum used for immunoprecipitation is produced by 

20 immunisation of rabbits (NZW, Charles River) with a recombinant fusion protein 
containing GST fused with a fragment of PAK5 corresponding to the PAK5 sequence 
residues 106 to 404 of SEQ ID NO:3. Immunisation of rabbits and production of antiserum 
was performed as described in Using Antibodies : A Laboratory Manual by Ed Harlow, 
David Lane, Cold Spring Harbor Laboratory Press; ISBN: 0879695439. 

25 Protein A-sepharose pellets containing immunoprecipitates obtained as described 

above in a final volume of 10 pi, are mixed with 20 pg myelin basic protein (MBP), or with 
20 pg histone HI (HI) in 10 pi of a 3X kinase reaction buffer (KRB) containing: 60 mM 
HEPES (pH 7.5), 30 mM magnesium acetate, 0.15 mM ATP, 3 mM DTT, 0.03 mM 
sodium orthovanadate. The reaction is started by the addition of 5 pCi [y-32P] ATP (10 pi). 

30 Samples are incubated for 5 minutes at 30°C and the reaction is stopped by addition of 4X 
Laemmli sample buffer. Proteins are separated on Tris/glycine SDS gels (pre-made, 
obtained from Bio-Rad, Richmond CA), stained with Coomassie blue, dried, exposed to 
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phosphoimageing plates (K plates, Bio-Rad, Richmond CA), and read on a phosphoimager 
(Bio-Rad, Richmond CA). 

Figure 1 shows the results of such analysis, from which it is clear that PAK5 is able 
to phosphorylate generic kinase substrates such as myelin basic protein and histone HI . 

5 

Example 4: High-Throughput Screening Assay to Identify Compounds that Modulate 
PAK5 Kinase Activity 

High throughput screening for modulator compounds can be performed 
using MBP coated 96-well FlashPlates® (NEN Life Science Products). Kinase reaction 

10 buffer (3X kinase reaction buffer (KRB)) contains: 60 mM HEPES (pH 7.5), 30 mM 
magnesium acetate, 0. 15 mM ATP, 3 mM DTT, 0.03 mM sodium orthovanadate), 0.25 
\xC\ [y33P]-ATP at a concentration no greater than 1 |ig/ml, (determined by titration of 
individual enzyme preparations for a concentration that allows kinetic determinations over 
a 1 hour time course of the kinase) are added to each well and incubated for 1 hour at 30°C 

15 in the presence or absence of 10 nM test compound. Total reaction volume is 100 (il. 
Following incubation, the reaction mixture is aspirated and the wells rinsed twice with 200 
|il PBS. Incorporation of radiolabeled phosphate is determined by scintillation counting 
(Packard Instrument Co.TopCount, 12-detector, 96-well microplate scintillation counter 
and luminescence counter, model B991200). Compounds which inhibit kinase activity >50 

20 percent at 10 jiM are indicated by a >50% reduction in scintillation counts. Specificity and 
selectivity is determined by titration of inhibitory compounds to determine the IC50 (or 
other standard quantitation well known in the art for comparison) and by the substitution of 
other kinases in the assay. For example, determination of the relative inhibitory activity of 
the kinase in comparison to recombinant PAK4 kinase, expressed and isolated in a similar 

25 manner, assayed under similar conditions, provides selectivity data. 

Example 5: High-Throughput Screening Assay to Identify Compounds that Modulate 
PAK5 Activity 

Test compounds are prepared in advance from 2.5 mg/ml stock solutions in 
30 DMSO by diluting 1:10 in distilled water, followed by an additional 1:10 dilution in water. 
10 \x\ of the 1: 100 dilution solutions (25 jig/ml in 1 % DMSO) are prepared in 96 well 
Microlite 1 plates (Dynex) and plates are stored at -20°C until the evening prior to the start 
of the assay. 
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The signal from wells containing test compounds is compared to zero 
inhibition wells containing 10 \x\ of 1% (v/v) DMSO solution in MilliQ water, and to 100% 
inhibition wells containing 10 \x\ of 200 mM EDTA in 1% DMSO solution in MilliQ water. 
50% inhibition wells contain a reference compound at a concentration known to provide 
5 approximately 50% inhibition in 1% (v/v) DMSO solution in MilliQ water. 

Assay components 

(1) recombinant Pak5 kinase (expressed in E. coli or eukaryotic cells as described herein) 
or a lysate of a prokaryotic or eukaryotic cell expressing recombinant enzyme, or the 

10 natural enzyme partially purified from a human cell line. 

(2) [y33-P]-adenosine triphosphate in 3X KRB 

(3) myelin basic protein linked to the surface of PVT SPA (Scintillation Proximity Assay) 
15 beads (purchased from Amersham Pharmacia Biotech) by an antibody-protein A or other 

appropriate method. 

To Microlite 1 plates containing 10 jj.1 of test compound, which were left on the bench 
overnight to reach room temperature, 20 |il of ATP/ATP33 is added, immediately followed 

20 by 30 jil of Enzyme, using two Multidrops. The plates are stacked (with an empty plate on 
top of each stack to minimize evaporation from the top plate) and left at room temperature 
for 105 minutes. 150 \x\ of "Stop Solution" containing anti-beads antibody and EDTA is 
added using a Multidrop. The plates are sealed with plate sealers and left on the bench 
overnight, surrounded by perspex screens. The plates are then centrifiiged (Heraeus 

25 Megafuge 3. OR) at 2500 rpm for 5 minutes and counted on a Topcount instrument, 
(isotope: P 33 ; counting time: 20 seconds/well). A threshold for inhibition is set, e.g., 60% 
inhibition of scintillation signal. Compounds reaching the inhibition threshold are scored 
as active. 

30 Example 6: Northern Blot Analysis 
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Northern blots were performed to examine the expression of mRNA. Sense and 
antisense primers are selected based on the PAK5 sequence set forth, as SEQ JDD NO: 2. A 
fragment from positions 318 - 1212 of SEQ ID NO: 2 was amplified and used as a probe. 

Multiple human tissue northern blot from Clontech (Human MTN #7760-1) were 
5 hybridized with the probe. Pre-hybridization was carried out at 42°C for 4 hours in SxSSC, 
1 x Denhardt's reagent, 0. 1% SDS, 50% formamide, 250 mg/ml salmon sperm DNA. 
Hybridization was performed overnight at 42 °C in the same mixture with the addition of 
about 1.5xl0 6 cpm/ml of labelled probe. 

The probe was labelled with y-32P-dCTP by Rediprime DNA labelling system 
10 (Amersham Pharmacia), purified on Nick Column (Amersham. Pharmacia) and added to 
the hybridization solution. The filters were washed several times at 42°C in 0.2x SSC, 0. 
1% SDS. Filters were exposed to phosphoimageing plates (K plates, Bio-Rad, Richmond 
CA), and read on a phosphoimager (Bio-Rad, Richmond CA). 

The results of such analysis is shown in figure 2A. Using PAK5 probe, a single 
15 approximately 5 kb mRNA is detected in brain (Figure 2A, upper panel). Equal loading of 
all the lanes was verified by filter hybridisation with a human actin probe (Figure 2A, 
lower panel). 

In order to further investigate the distribution of PAK5 in normal human brain, the 
Northern blotting procedure described above was repeated using Clontech brain sub-region 
20 blots (Clontech Human MTN Brain II, #7755-1, and Human MTN Brain IV, #7769-1), 
The result of this analysis is shown in figure 2B. PAK5 is particularly strongly expressed 
in cerebellum, cerebral cortex, occipital pole and frontal lobe, but can be readily detected in 
most other region of the brain. 

25 Example 7: Expression of PAKS in Mammalian Cells 

1. Expression of PAKS in 293 cells 

For expression of PAK5 in mammalian cells 293 (transformed human, 
primary embryonic kidney cells), a plasmid bearing the relevant PAK5 coding sequence is 
prepared, using vector pcDNA3.1 myc-his (Invitrogen). The plasmid contains nucleotides 
30 1 through 2157 of SEQ ID NO:2. Vector pcDNA3.1 contains the c-myc epitope for 
detection of the recombinant protein with the anti-myc antibody, a C-terminal polyhistidine 
for purification with nickel chelate chromatography, and a Neomycin resistant gene for 
selection of stable transfectants. The forward primer for amplification of PAKS cDNA is 
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selected using methods available to one of skill in the art based on the sequence of SEQ ID 
NO: 2 and including a 5' extension of 19 nucleotides to introduce the NotI cloning site and 
22 nucleotides matching the PAK5 sequence. The reverse primer is selected using methods 
available to one of skill in the art based on the sequence of SEQ ID NO: 2 and which 
5 contains a 5' extension of 8 nucleotides to introduce a BamHl restriction site for cloning 
and 17 nucleotides corresponding to the reverse complement of the PAK5 sequence. The 
PCR conditions are 55oC as the annealing temperature. The PCR product is gel purified 
and cloned into the Notl-BamHl sites of the vector. 

The DNA is purified using Qiagen chromatography columns and transfected 

10 into 293 cells using SUPERFACT transfection media (Qiagen). Transiently transfected 
cells are tested for expression after 24 hours of transfection, using Western blots probed 
with anti-His and anti-PAK5 peptide antibodies. Permanently transfected cells are selected 
with G418 and propagated. Production of the recombinant protein is detected from cells by 
Western blots probed with anti-His, anti-Myc or anti-P AK5 peptide antibodies. 

15 , For expression of a fragment lacking the kinase region of PAK5, a plasmid 

comprising nucleotides 1-1347 is generated, following the procedure set forth above. 

For expression of a fragment containing the kinase domain of PAK5, a 
plasmid comprising nucleotides 13 18-2157 is generated. 

20 2. Expression of PAK5 in COS cells 

For expression of PAK5 in COS7 cells, a polynucleotide molecule having 
the sequence given of SEQ ID NO:2 is cloned into vector pSecTag2A. Vector pSecTag2A 
contains the murine IgK chain leader sequence for secretion, the c-myc epitope for 
detection of the recombinant protein with the anti-myc antibody, a C-terminal polyhistidine 

25 for purification with nickel chelate chromatography, and a Zeocin resistant gene for 
selection of stable transfectants. 

The forward primer for amplification of PAK5 cDNA is selected using 
methods available to one of skill in the art based on the sequence of SEQ ID NO: 2 and 
including a 5' extension of 19 nucleotides to introduce the Hindlll restriction site for 

30 cloning and 22 nucleotides matching the PAK5 sequence given in SEQ ID NO: 2. The 
reverse primer is selected using methods available to one of skill in the art based on the 
sequence of SEQ ED NO: 2 and which contains a 5' extension of 8 nucleotides to introduce 
a BamHl restriction site for cloning and 17 nucleotides corresponding to the reverse 
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complement of the PAK5 sequence given in SEQ ID NO: 2. The PCR consists of an initial 
denaturation step of 5 min at 95 C, 30 cycles of 30 sec denaturation at 95 C, 30 sec 
annealing at 58 C and 30 sec extension at 72 C, followed by 5 min extension at 72 C. The 
PCR product is gel purified and ligated into the Xbal and Sail sites of vector p3-CI. This 
5 construct is transformed into E. coli cells for amplification and DNA purification. The 
DNA is purified with Qiagen chromatography columns and transfected into COS 7 cells 
using Lipofectamine reagent from BRL, following the manufacturer's protocols. Forty- 
eight and 72 hours after transfection, the media and the cells are tested for recombinant 
protein expression. 

10 PAK5 expressed from a COS cell culture can be purified by concentrating 

the cell-growth media to about 10 mg of protein/ml, and purifying the protein by, for 
example, chromatography. Purified PAK5 is concentrated to 0.5 mg/ml in an Amicon 
concentrator fitted with a YM-10 membrane and stored at -80oC. 

15 Example 8: Expression of PAKS in Insect Cells 

For expression of PAKS in a baculovirus system, a polynucleotide molecule 
having the sequence given as SEQ ED NO:2 was amplified by PCR. The forward primer 
first consists of a 5' extension which adds the NotI cloning site, followed by 22 nucleotides 
which correspond to nucleotide of the sequence given in SEQ ID NO:2. The reverse 

20 primer first comprises a 5' extension which introduces the BamHl cloning site, followed 
by followed by 17 nucleotides which correspond to the reverse complement of nucleotides 
given in SEQ ID NO:2. 

-The PCR product is gel purified, digested with Ndel and Kpnl, and cloned 
into the corresponding sites of vector pACHTL- A (Pharmingen, San Diego, CA). The 

25 pAcHTL expression vector contains the strong polyhedrin promoter of the Autographa 
californica nuclear polyhedrosis virus (AcMNPV), and a 6XHis tag upstream from the 
multiple cloning site. A protein kinase site for phosphorylation and a thrombin site for 
excision of the recombinant protein precede the multiple cloning site is also present. Of 
course, many other baculovirus vectors could be used in place of pAcHTL-A, such as 

30 pAc373, pVL941 and pAcIMl. Other suitable vectors for the expression of PAKS 
polypeptides can be used, provided that the vector construct includes appropriately located 
signals for transcription, translation, and trafficking, such as an in-frame AUG and a signal 
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peptide, as required. Such vectors are described in Luckow et al., Virology 170:31-39, 
among others. 

The virus is grown and isolated using standard baculovirus expression 
methods, such as those described in Summers et al. (A Manual of Methods for Baculovirus 
5 Vectors and Insect Cell Culture Procedures, Texas Agricultural Experimental Station 
Bulletin No. 1555 (1987)). 

In a preferred embodiment, pAcHLT-A containing the PAK5 gene is 
introduced into baculovirus using the "BacuIoGold " transfection kit (Pharmingen, San 
Diego, CA) using methods established by the manufacturer. Individual virus isolates are 

10 analyzed for protein production by radiolabeling infected cells with 35S-methionine at 24 
hours post infection. Infected cells are harvested at 48 hours post infection, and the labeled 
proteins are visualized by SDS-PAGE. Viruses exhibiting high expression levels can be 
isolated and used for scaled up expression. 

For expression of the PAK5 polypeptide in a Sf9 cells, a polynucleotide 

15 molecule having the sequence given in SEQ ID NO:2 is amplified by PCR using the 
primers and methods described above for baculovirus expression. The PAK5 cDNA is 
cloned into vector pAcHLT-A (Pharmingen) for expression in Sf9 insect. The insert is 
cloned into the NotI and BamHl sites, after elimination of an internal Ndel site (using the 
same primers described above for expression in baculovirus). DNA is purified with Qiagen 

20 chromatography columns and expressed in Sf9 cells. Preliminary Western blot experiments 
from non-purified plaques are tested for the presence of the recombinant protein of the 
expected size which reacted with the PAK5-specific antibody. These results are confirmed 
after further purification and expression optimization in HiG5 cells. 

25 Example 9: Interaction Trap/Two-Hybrid System 

In order to assay for PAK5-interacting proteins, the interaction trap/two- 
hybrid library screening method can be used. This assay was first described in Fields et 
al, Nature, 1989, 340, 245, which is incorporated herein by reference in its entirety. A 
protocol is published in Current Protocols in Molecular Biology 1999, John Wiley & Sons, 
30 NY and Ausubel, F.M et al. 1992, Short protocols in molecular biology, Fourth edition, 
Greene and Wiley-Interscience, NY, which is incorporated herein by reference in its 
entirety. Kits are available from Clontech, Palo Alto, CA (Matchmaker Two-Hybrid 
System 3). 
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A fusion of the nucleotide sequences encoding all or partial PAK5 and the 
yeast transcription factor GAL4 DNA-binding domain (DNA-BD) is constructed in an 
appropriate plasmid (i.e. pGBKT7) using standard subcloning techniques. Similarly, a. 
GAL4 active domain (AD) fusion library is constructed in a second plasmid (i.e. pGADT7) 
5 from cDNA of potential PAK5-binding proteins (for protocols on forming cDNA libraries, 
see Sambrook et al. 1989, Molecular cloning: a laboratory manual, second edition, Cold 
Spring Harbor Press, Cold Spring Harbor, NY), which is incorporated herein by reference 
in its entirety. The DNA-BD/ PAK5 fusion construct is verified by sequencing, and tested 
for autonomous reporter gene activation and cell toxicity, both of which would prevent a 

10 successful two-hybrid analysis. Similar controls are performed with the AD/library fusion 
construct to ensure expression in host cells and lack of transcriptional activity. Yeast cells 
are transformed {ca. 10 5 transformants/mg DNA) with both the PAK5 and library fusion 
plasmids according to standard procedure (Ausubel, et al, 1992, Short protocols in 
molecular biology, Fourth edition, Greene and Wiley-Interscience, NY, which is 

15 incorporated herein by reference in its entirety). In vivo binding of DNA-BD/ PAK5 with 
AD/library proteins results in transcription of specific yeast plasmid reporter genes (ie. 
lacZ, HIS3, ADE2, LEU2). Yeast cells are plated on nutrient-deficient media to screen for 
expression of reporter genes. Colonies are dually assayed for 3-galactosidase activity upon 
growth in Xgal (5-bromo-4-chloro-3-indolyl-3-D-galactoside) supplemented media (filter 

20 assay for 3-galactosidase activity is described in Breeden et al y Cold Spring Harb. Symp. 
Quant Biol y 1985, 50, 643, which is incorporated herein by reference in its entirety). 
Positive AD-library plasmids are rescued from transformants and reintroduced into the 
original yeast strain as well as other strains containing unrelated DNA-BD fusion proteins 
to confirm specific PAK5/library protein interactions. Insert DNA is sequenced to verify 

25 the presence of an open reading frame fused to GAL4 AD and to determine the identity of 
the PAK5. 

Example 10: Mobility Shift DNA-Binding Assay Using Gel Electrophoresis 

A gel electrophoresis mobility shift assay can rapidly detect specific protein- 

*. 

30 DNA interactions. Protocols are widely available in such manuals as Sambrook et al. 
1989, Molecular cloning: a laboratory manual, second edition. Cold Spring Harbor Press, 
Cold Spring Harbor, NY and Ausubel, F. M. et al. 1992, Short protocols in molecular 
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biology, Fourth edition, Greene and Wiley-Interscience, NY, each of which is incorporated 
herein by reference in its entirety. 

Probe DNA(<300 bp) is obtained from synthetic oligonucleotides, 
restriction endonuclease fragments, or PCR fragments and end-labeled with 32 P. An 

5 aliquot of purified PAK5 (ca. 15 pg) or crude PAK5 extract (ca. 15 ng) is incubated at 
constant temperature (in the range 22-37 C) for at least 30 minutes in 10-15 fil of buffer 
(i.e., TAE or TBE, pH 8.0-8.5) containing radiolabeled probe DNA, nonspecific carrier 
DNA (ca. 1 pg), BSA (300 pg/ml), and 10% (v/v) glycerol. The reaction mixture is then 
loaded onto a polyacrylamide gel and run at 30-35 mA until good separation of free probe 

10 DNA from protein-DNA complexes occurs. The gel is then dried and bands corresponding 
to free DNA and protein-DNA complexes are detected by autoradiography. 

Some of the preferred embodiments of the invention described above are outlined 
below and include, but are not limited to, the following embodiments. As those skilled in 
15 the art will appreciate, numerous changes and modifications may be made to the preferred 
embodiments of the invention without departing from the spirit of the invention. It is 
intended that all such variations fall within the scope of the invention. The entire 
disclosure of each publication cited herein is hereby incorporated by reference. 
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WHAT IS CLAIMED IS: 

1. An isolated nucleic acid molecule comprising a nucleotide sequence selected from 
the group consisting of: 

a) SEQ ID NO: 1, or a fragment thereof; 
5 b) SEQ ID NO:2, or a fragment thereof; 

c) a sequence complementary to at least a portion of SEQ ID NO:l or SEQ ID 

NO:2; 

d) a sequence homologous to SEQ ID NO:l or SEQ ID NO:2, or a fragment 
thereof; 

10 e) a sequence that encodes a polypeptide comprising SEQ ED NO:3, or a 

fragment thereof; and 

f) a sequence that encodes a polypeptide comprising an amino acid sequence 
homologous to SEQ ID NO:3, or a fragmenUhereof. 

15 2. The nucleic acid molecule of claim 1 wherein said nucleic acid molecule is DNA. 

3. The nucleic acid molecule of claim 1 wherein said nucleic acid molecule is RNA. 

4. The nucleic acid molecule of claim 2 wherein said nucleotide sequence comprises 
20 SEQ ED NO:2. 

5. The nucleic acid molecule of claim 1 wherein said molecule is an antisense 
oligonucleotide directed to SEQ ID NO:l or SEQ ID NO:2. 

25 6. The nucleic acid molecule of claim 5 wherein said oligonucleotide is directed to a 
regulatory region of SEQ ID NO: 1 or SEQ ID NO:2. 

7. An expression vector comprising a nucleic acid molecule of claim 1. 

30 8. An expression vector of claim 7, wherein said nucleic acid molecule comprises 
SEQ ID NO:2. 



9. 



The vector of claim 7 wherein said vector is a plasmid. 
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10. The vector of claim 7 wherein said vector is a viral particle. 

11. The vector of claim 10 wherein said vector is selected from the group consisting of 
5 adenoviruses, parvoviruses, herpesviruses, poxviruses, adeno-associated viruses, Semliki 

Forest viruses, vaccinia viruses, and retroviruses. 

12. The vector of claim 7 wherein said nucleic acid molecule is operably linked to a 
promoter selected from the group consisting of simian virus 40, mouse mammary tumor 

10 virus, long terminal repeat of human immunodeficiency virus, maloney virus, 
cytomegalovirus immediate early promoter, Epstein Barr virus, rous sarcoma virus, human 
actin, human myosin, human hemoglobin, human muscle creatine, and human 
metalothionein. 

15 13. A host cell transformed with a vector of claim 7. 

14. The transformed host cell of claim 13 wherein said cell is a bacterial cell. 

15. The transformed host cell of claim 14 wherein said bacterial cell is E. coli. 

20 

16. The transformed host cell of claim 13 wherein said cell is yeast. 

17. The transformed host cell of claim 16 wherein said yeast is S. cerevisiae. 
25 18. The transformed host cell of claim 13 wherein said cell is an insect cell. 

19. The transformed host cell of claim 18 wherein said insect cell is S. frugiperda. 

20. The transformed host cell of claim 13 wherein said cell is a mammalian cell. 

21. The transformed host cell of claim 20 wherein mammalian cell is selected from the 
group consisting of Chinese hamster ovary cells, HeLa cells, African green monkey kidney 
cells, human 293 cells, and murine 3T3 fibroblasts. 
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22. A method of producing a polypeptide comprising SEQ ID NO:3, or a homolog or 
fragment thereof, comprising the steps of: 

a) introducing a recombinant expression vector of claim 7 into a compatible 
5 host cell; 

b) growing said host cell under conditions for expression of said polypeptide; 

and 

c) recovering said polypeptide from said host cell. 

10 23. The method of claim 22 wherein said host cell is lysed and said polypeptide is 
recovered from the lysate of said host cell. 

24. The method of claim 22 wherein said polypeptide is recovered by purifying the 
culture medium from said host cell without lysing said host cell. 

15 

25. A composition comprising a nucleic acid molecule of claim 1 and an acceptable 
carrier or diluent. 

26. A composition comprising a recombinant expression vector of claim 7 and an 
20 acceptable carrier or diluent. 

i 

27. An isolated polypeptide encoded by a nucleic acid molecule of claim 1. 

28. The polypeptide of claim 27 wherein said polypeptide comprises SEQ ID NO:3. 

25 

29. The polypeptide of claim 27 wherein said polypeptide comprises an amino acid 
sequence homologous to SEQ ID NO:3. 

30. The polypeptide of claim 29 wherein said sequence homologous to SEQ ID NO:3 
30 comprises at least one conservative amino acid substitution compared to SEQ ID NO:3. 

31. The polypeptide of claim 27 wherein said polypeptide comprises a fragment of SEQ 
ID NO:3. 
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32. A composition comprising a polypeptide of claim 27 and an acceptable carrier or 
diluent. 

5 33. An isolated antibody which binds to an epitope on a polypeptide of claim 27. 

34. The antibody of claim 33 wherein said antibody is a monoclonal antibody. 

35. A composition comprising an antibody of claim 33 and an acceptable carrier or 
10 diluent. 

36. A kit comprising an antibody which binds to a polypeptide of claim 27 and a 
negative control antibody 

15 37. The kit of claim 36 further comprising an additional kit component. 

38. The kit of claim 37 wherein said additional kit component comprises instructions. 

39. A method of inducing an immune response in a mammal against a polypeptide of 
20 claim 27 comprising administering to said mammal an amount of said polypeptide 

sufficient to induce said immune response. 

40. A method for identifying a compound which binds PAK5 comprising the steps of: 
a) contacting PAK5 with a compound; and 

25 b) determining whether said compound binds PAK5. 

41. The method of claim 40 wherein binding of said compound to PAK5 is 
determined by a protein binding assay. 

30 42. The method of claim 41 wherein said protein binding assay is selected from 

the group consisting of a gel-shift assay, Western blot, radiolabeled competition assay. 



A method for identifying a compound which binds a nucleic acid molecule 
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encoding PAK5 comprising the steps of: 

a) contacting said nucleic acid molecule encoding PAK5 with a 
compound; and 

b) determining whether said compound binds said nucleic acid 

5 molecule. 

44. The method of claim 43 wherein binding is determined by a gel-shift assay. 

45. A method for identifying a compound which modulates the activity of 
10 PAK5 comprising the steps of: 

a) contacting PAK5 with a compound; and 

b) determining whether PAK5 activity has been modulated. 

46. The method of claim 45 wherein said activity is serine/threonine kinase 
15 activity. 

47. The method of claim 45 wherein said activity is a PAK5 dependent cell 
biological response. 

20 48. A compound identified by the method of claim 40. 

49. A compound identified by the method of claim 43. 

50. A compound identified by the method of claim 45. 
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Figure 1A 
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Figure 1B 
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Figure 2A 
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SEQUENCE LISTING 

<110> Phanracia & Upjohn S.p.A. 

<120> A new member of the PAK protein family, nucleic acids 
and methods related to the same 

<130> 00019 

<140> 
<141> 

<150> US 09/439756 
<151> 1999-11-15 

<160> 14 

<170> Patentln Ver .2.1 

<210> 1 

<211> 2511 

<212> DNA 

<213> Homo sapiens 

<400> 1 

gagaccggga acatggcgct gggagcnctg tagcagctga gaaggggctg aggcaccgcc 60 
gcttcgctga cagccggcca ccagatgttc atgcattcta gagaaagtgg aaaacttaga 120 
agcctaatta atgactgtct tctggacctc tgagaccatg tttctagtgt tttccgtgga 180 
atattatcag aaatacactg tggtgaaatg cttccacctc ttgctaaaat gaacactgag 240 
gaaaaatgaa gaagactgac aagcaccagc gaaaagttgc agaatagaaa tagccacact 300 
cctctggagt ctttaattca tccacagcca tcatataaag gttttggcat catgtttggg 360 
aagaaaaaga aaaagattga aatatctggc ccgtccaact ttgaacacag ggttcatact 420 
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gggtttgatc 


cacaagagca 


gaagtttacc 


ggccttcccc 


agcagtggca 


cagcctgtta 


480 


gcagatacgg 


ccaacaggcc 


aaagcctatg 


gtggaccctt 


catgcatcac 


acccatccag 


540 


ctggctccta 


tgaagacaat 


cgttagagga 


aacaaaccct 


gcaaggaaac 


ctccatcaac 


600 


ggcctgctag 

13 13 13 ^^13 


aggattttga 


caacatctcg 


gtgactcget 


ccaactccct 


aaggaaagaa 


660 


agcccaccca 


ccccagatca 


aacracrcctcc 

1313 13 s -* 13 «- 


acrccaccrcrtc 


caacfccaccrc 


oaaacraaaa t 


720 


ggcttcatca 


ccttctccca 


gtattccagc 


gaatccgata 


ctactcrctaa 


ctacaccracc 


780 


gaaaagtaca 


crGGacraacracr 

I31313 , -*l3 , -*'-*13*-*=3 


tctctatgga 


crataatctoa 


atccotatta 


tacraQQcaQc 


840 


cacgcagcca 


agcaaaa tgg 


crcaccrtaatcr 


aaaa t gaagc 


ac actCToaaQc 


ctactattct 


900 


cracicrtcraacrc 


c 1 1 toaaa t c 


ccrattttocc 


aoattttctcr 


c pcra 1 1 a t* p a 


p t* papa t* 1 1 rr 


960 


cractcactcra 


ocaaaccaaa 


taaatacaot 


aacctcaacrt 

^3«-*v— -v^ UWHUH 


craera p/t a t pa 


erapapppt - per 

y ciy ciy v_v_ toy 


1020 


acrtacrctccc 


ctctacratta 


t teat tccaa 


ttcacacctt 


c t acraap t cr p 


a nppa p pa pp 


1080 


crcrcr t ac t c ca 


acrcr aaacrcc t 


oaccrtacacrt 


craaant craa h 

yaaay uy caCa. 




ppt ppa t pa p 




ta t cracacicra 


crcrccaaaatc 


ttcatapptcr 








1 ?nn 


cslcislgg t c ca 


aatcaCTcrctc 


crcrcrac t ceacr 


craa ppcra t era 


t" nppa 1 1* trsci 


ciy ciciy u.y v^ci 




tt taaa_accc 




a pa p t~p p t" a p 


aapf*pptapa 




t~ t~ rr+~ r^r^t^izi rr 




cccacaatat 


ocattccaaa. 


crcr t ocra t~ ta o 


era t* pnanpa p 




x_-Ciy ^ — v^V~. L>v^> k^CL 


X J O \J 


c t g t caocrcrt 


c t aacac c ta 




pptcrppaaap 




a. ciciy Vw clclci 


144D 


tCCTOGCtatt 


cc t caacrcarr 


tcaccacrtac 


p pert" p t p/erert 




^uL>V« L» I— y LuU 




catcacccct 


ppp tp/eaoap/ 


papt t pppap 

v-ciy i— V—- y * — o.y 


f- a r» a t - p t p pa 


^yy^ ^ ^ *-ci 


i~ nzz rrr* t~ ^ 
uyayuLLu 


1 SfiO 

IJOu 


ctcagcctct 


catccacrcac 


ctaccccrcccr 




CTPtCCtCPtP 


^y civ»,^ciy ^_ciy 


1620 


ccctccaaaa 


tatcccataa 


acacrt t tccra 


opaoccctCTP 


aactoatcrcrt 


^ayvjoL ciy y ci 


1680 

1UUU 


gac c c caggg 


aatacttggc 


caactttatc 


aaaa t c acrocr 
m^mi^ «^>>^ yyyy 


aacrac t caac 


CQCTcatccrta 


1740 


tgcatcggca 


ccgagaaaca 


cacacrcraaaa 


caaa 1 1 ocaa 


t aaaaaaaa t 


aaac c t ccaa 


1800 


aagcaacaga 


gacgagaact 


gcttttcaat 


aaaatcataa 


tea t Qccraaa 

w ■w'w* w 13 13 13 


ttac caeca t 


1860 


gacaatgtgg 

-3 13 1313 


ttgacatgta 


cagcagctac 


cttatcacrccF 

^ ^-is ^yy^y 


atgagctctg 


aataatcata 

yy ^yy i_y 


1920 


gagtttctag 


aaggtggtgc 

13 13 13 13 -3 


cttgacagac 


attataactc 


acaccagaat 


gaa t gaagaa 


1980 


cagacagcua 


ctgtctgcct 


gtcagttctg 


agagctctct 


cctaccttca 


taaccaagga 


2040 


gtgattcaca 


gggacataaa 


aagtgactcc 


atcctcctga 


caagegatgg 


ccggataaag 


2100 


ttgtctgatt 


ttggtttctg 


tgctcaagtt 


tccaaagagg 


tgecgaagag 


gaaatcattg 


2160 


gttggcactc 


cctactggat 


ggcccctgag 


ctgatttcta 


ggctacctta 


tgggacagag 


2220 


gtggacatct 


ggtccctcgg 


gatcatggtg 


atagaaatga 


ttgatggcga 


gcccccctac 


2280 


ttcaatgagc 


ctcccctcca 


ggegatgegg 


aggatceggg 


acagtttacc 


tccaagagtg 


2340 
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aaggacctac acaaggtttc ttcagtgctc cggggattcc tagacttgat gttggtgagg 2400 
gagccctctc agagagcaac agcccaggaa ctcctcggac atccattctt aaaactagca 2460 
ggtccaccgt cttgcatcgt ccccctcatg agacaataca ggcatcactg a 2511 



<210> 2 
<211> 2157 
<212> DNA 



<211> Homn 


cam on c: 




<4no> ? 






a v-y i— t-yyyu 


ay a. ddddy eld 


dctciy ci i_ i_y act 


y u ui~d lat i_y 


yy Lu^yatcL 


aLaayayCdy 


a rTr« r« t~ rrt - i~ ^ rr 
QyLL i_ y uoy 


'-uya La^yy^ 


LaaLayyL L-d 


cccatccaac 


t crcrr* t~ c c t" a h 


yaayaL>aa *— 


tccatcaaccr 


ar 1 r* t" rrr* h ^ rr^ 

uy*w uuyu 




aoaaaaaaaa 


occcacccac 


r* p na era t~ p a n 


craacraaaata 


acttcatcac 


cttcfccccacr 


tacacgaccg 


aaaagtacag 


ggagaagagt 


agaggcagcc 


acgcagccaa 


gcaaaatggg 


tactattctg 


aggtgaagee 


tttgaaatcc 


tcacatttgg 


actcactgag 


caaaccaagt 


agagcctcga 


gtagctcccc 


tctggattat 


gggaccagcg 


ggtgctccaa 


ggagagcctg 


ctggatgact 


atgacaggag 


gecaaagtet 


accatgcggc 


agaggtccag 


gtcaggctcg 


gcaagtgcat 


ttaaaaccca 


tccccaagga 


ttgtccgagc 


ccacaatgtg 


cattccaaag 


agccctccac 


tgtcagggtc 


tgacacctac 


caaagcaaat 


egggctatte 


ctcaagcagt 


accttgtacc 


atcacccctc 


cctgcagagc 


ctgagctccc 


tcagcctctc 


atccagcacc 



atatctggee cgtccaactt tgaacacagg 60 
aagtttaccg gccttcccca gcagtggcac 120 
aagcctatgg tggacccttc atgeatcaca 180 
gttagaggaa acaaaccctg caaggaaacc 240 
aacatctegg tgactcgctc caactcccta 300 
ggagcctcca gccacggtcc aggccacgcg 360 
tattccagcg aatccgatac tactgetgae 420 
ctctatggag atgatctgga teegtattat 480 
caegtaatga aaatgaagca eggggaggee 540 
gattttgeca gattttctgc cgattatcac 600 
gaatacagtg acctcaagtg ggagtatcag 660 
tcattccaat tcacaccttc tagaactgea 720 
gcgtacagtg aaagtgaatg gggacccagc 780 
tegtacctga atcagacaag ccctcagccc 840 
ggactccagg aaccgatgat gccatttgga 900 
cactcctaca actcctacac ctaccctcgc 960 
gtggattacg atcgagcaca gatggtcctc 1020 
cccaggggcc ctgccaaact acctcaaagt 1080 
caccagtacc cgtctgggta ccacaaagcc 1140 
agttcgcagt acatctccac ggcttcctac 1200 
tacccgccgc ccagctgggg ctcctcctcc 1260 
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gaccagcagc 


cctccagggt 


gtcccatgaa 


cagtttcggg 


cggccctgca 


gctggtggtc 


1320 


agcccaggag 


accccaggga 


atacttggcc 


aactttatca 


aaatcgggga 


aggctcaacc 


1380 


ggcatcgtat 


gcatcggcac 


cgagaaacac 


acagggaaac 


aagttgcagt 


gaagaaaatg 


1440 


gacctccgga 


agcaacagag 


acgagaactg 


cttttcaatg 


aggtcgtgat 

— ' ■ — * — « ^ 


catgcgggat 


1500 


taccaccatg 


acaatgtggt 


tgacatgtac 


agcagctacc 


ttgtcggcga 


tgagctctgg. 


1560 


gtggtcatgg 


agtttctaga 


aggtggtgcc 


ttgacagaca 


ttgtgactca 


caccagaatg 


1620 


aatgaagaac 


agatagctac 


tgtctgcctg 


tcagttctga 


gagctctctc 


ctaccttcat 


1680 


aaccaaggag 


tgattcacag 


ggacataaaa 


agtgactcca 


tcctcctgac 


aagcga tggc 


1740 


cggataaagt 


tgtctgattt 


tggtttctgt 


gctcaagttt 


ccaaagaggt 


occcraaaaao 


1800 


aaaucaucgg 


ccggcacucc 


ctactggatg 


gcccc fcgagc 


cgacttctag 


gctaccttat 


I860 


gggacagagg 


tggacatctg 


gtccctcggg 


atcatggtga 


tagaaatgat 


tgatggcgag 


1920 


cccccctact 


tcaatgagcc 


tcccctccag 


gcgatgcgga 


ggatccggga 


cagtttacct 


1980 


ccaagagtga 


aggacctaca 


caaggtttct 


tcagtgctcc 


ggggattcct 


agacttgatg 


2040 


ttggtgaggg 


agccctctca 


gagagcaaca 


gcccaggaac 


tcctcggaca 


tccattctta 


2100 


aaactagcag 


gtccaccgtc 


ttgcatcgtc 


cccctcatga 


gacaatacag 


gcatcac 


2157 



<210> 3 
<211> 719 
<212> PRT 

<213> Homo sapiens 
<400> 3 

Met Phe Gly Lys Lys Lys Lys Lys lie Glu He Ser Gly Pro Ser Asn 
1 5 10 15 

Phe Glu His Arg Val His Thr Gly Phe Asp Pro Gin Glu Gin Lys Phe 
20 25 30 

Thr Gly Leu Pro Gin Gin Trp His Ser Leu Leu Ala Asp Thr Ala Asn 
35 40 45 
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Arg Pro Lys Pro Met Val Asp Pro Ser Cys lie Thr Pro lie Gin Leu 

50 55 * 60 

Ala Pro Met Lys Thr lie Val Arg Gly Asn Lys Pro Cys Lys Glu Thr 
65 70 75 80 

Ser lie Asn Gly Leu Leu Glu Asp Phe Asp Asn lie Ser Val Thr Arg 
85 90 95 

Ser Asn Ser Leu Arg Lys Glu Ser Pro Pro Thr Pro Asp Gin Gly Ala 
100 105 110 

Ser Ser His Gly Pro Gly His Ala Glu Glu Asn Gly Phe He Thr Phe 
115 120 125 

Ser Gin Tyr Ser Ser Glu Ser Asp Thr Thr Ala Asp Tyr Thr Thr Glu 
130 135 140 

Lys Tyr Arg Glu Lys Ser Leu Tyr Gly Asp Asp Leu Asp Pro Tyr Tyr 
145 150 155 160 

*" Arg Gly Ser His Ala Ala Lys Gin Asn Gly His Val Met Lys Met Lys 
165 170 175 

His Gly Glu Ala Tyr Tyr Ser Glu Val Lys Pro Leu Lys Ser Asp Phe 
180 185 190 

Ala Arg Phe Ser Ala Asp Tyr His Ser His Leu Asp Ser Leu Ser Lys 
195 200 205 

Pro Ser Glu Tyr Ser Asp Leu Lys Trp Glu Tyr Gin Arg Ala Ser Ser 
210 ' 215 220 
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Ser Ser Pro Leu Asp Tyr Ser Phe Gin Phe Thr Pro Ser Arg Thr Ala 
225 230 235 240 

Gly Thr Ser Gly Cys Ser Lys Glu Ser Leu Ala Tyr Ser Glu Ser Glu 
245 250 255 

Trp Gly Pro Ser Leu Asp Asp Tyr Asp Arg Arg Pro Lys Ser Ser Tyr 
260 265 270 



Leu Asn Gin Thr Ser Pro Gin Pro Thr Met Arg Gin Arg Ser Arg Ser 
275 280 285 

Gly Ser Gly Leu Gin Glu Pro Met Met Pro Phe Gly Ala Ser Ala Phe 
290 295 300 

Lys Thr His Pro Gin Gly His Ser Tyr Asn Ser Tyr Thr Tyr Pro Arg 
305 310 315 320 

Leu Ser Glu Pro Thr Met Cys lie Pro Lys Val Asp Tyr Asp Arg Ala 
325 330 335 

Gin Met Val Leu Ser Pro Pro Leu Ser Gly Ser Asp Thr Tyr Pro Arg 
340 345 350 

Gly Pro Ala Lys Leu Pro Gin Ser Gin Ser Lys Ser Gly Tyr Ser Ser 
355 360 365 

Ser Ser His Gin Tyr Pro Ser Gly Tyr His Lys Ala Thr Leu Tyr His 
370 375 380 
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His Pro Ser Leu Gin Ser Ser Ser Gin Tyr lie Ser Thr Ala Ser Tyr 
385 390 395 400 

Leu Ser Ser Leu Ser Leu Ser Ser Ser Thr Tyr Pro Pro Pro Ser Trp 
405 410 415 



Gly Ser Ser Ser Asp Gin Gin Pro Ser Arg Val Ser His Glu Gin Phe 
420 425 430 



Arg Ala Ala Leu Gin Leu Val Val Ser Pro Gly Asp Pro Arg Glu Tyr 
435 440 445 



Leu Ala Asn Phe lie Lys lie Gly Glu Gly Ser Thr Gly lie Val Cys 
450 455 460 ' 

He Gly Thr Glu Lys His Thr Gly Lys Gin Val Ala Val Lys Lys Met 
465 470 475 480 

Asp Leu Arg Lys Gin Gin Arg Arg Glu Leu Leu Phe Asn Glu Val Val 
485 490 495 

He Met Arg Asp Tyr His His Asp Asn Val Val Asp Met Tyr Ser Ser 
500 505 510 

Tyr Leu Val Gly Asp Glu Leu Trp Val Val Met Glu Phe Leu Glu Gly 
515 520 525 

Gly Ala Leu Thr Asp He Val Thr His Thr Arg Met Asn Glu Glu Gin 
530 535 540 

He Ala Thr Val Cys Leu Ser Val Leu Arg Ala Leu Ser Tyr Leu His 
545 550 555 560 
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Asn Gin Gly Val lie His Arg Asp lie Lys Ser Asp Ser lie Leu Leu 
565 570 575 

Thr Ser Asp Gly Arg lie Lys Leu Ser Asp Phe Gly Phe Cys Ala Gin 
580 585 590 

Val Ser Lys Glu Val Pro Lys Arg Lys Ser Leu Val Gly Thr Pro Tyr 
595 600 605 

Trp Met Ala Pro Glu Leu lie Ser Arg Leu Pro Tyr Gly Thr Glu Val 
610 615 620 

Asp He Trp Ser Leu Gly He Met Val He Glu Met He Asp Gly Glu 
625 630 635 640 

Pro Pro Tyr Phe Asn Glu Pro Pro Leu Gin Ala Met Arg Arg He Arg 
645 650 655 

Asp Ser Leu Pro Pro Arg Val Lys Asp Leu His Lys Val Ser Ser Val 
660 665 670 

Leu Arg Gly Phe Leu Asp Leu Met Leu Val Arg Glu Pro Ser Gin Arg 
675 680 685 

Ala Thr Ala Gin Glu Leu Leu Gly His Pro Phe Leu Lys Leu Ala Gly 
690 695 700 

Pro Pro Ser Cys He Val Pro Leu Met Arg Gin Tyr Arg His His 
705 710 715 
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<210> 4 
<211> 1638 
<212> DNA 



<213> Homo 


sapiens 




<400> 4 






a tcrt cslslsl ta 


acooc c t aoa 


cattcaaaac 


aocactatcra 


1 1 crcracrc paa 


pacrpaaaaat 


p t cr p p t" p pa a 


apppacracrna 


y ciuy uuuuu^ 


cr cracra t aaaa 




craaacracraaa 


t tt craacaca 


caat tea tort 


pcrcrttttcrat 

vyy ^ i_y<_i l» 


aaacaotcraa 

y t«*-y ^^y ^333 


ccccrcttcrct 


tcacracatca 


ppcrpacrcrPtcr 


t~ tetcrcrat"crt 

*— »— v— *-y y c* v-y v- 


crt t crcracrt tt 


aaa t aca tcra 


crp 1 1 1 a p acra 


taacrtpacrpt 


crt"craacrcrctcr 

y ^au^^u *"-y 


t" cr t p t era era p 


t pp t Cfpacrt cr 

v— i-.y \^uy i — y 


aa t era taa t a 


ctaccccacc 


aecacrtcratt 


fcacacaccrcrt 


c t cr t oa t tcra 


accacttcct 


cccatttcac 


ctactgaaaa 


t aacaccac t 


aagcagaaga 


agaagectaa 


aatgtctgat 


qtgaQtqtqcr 


gcgatcctaa 


gaagaaatat 


tcaggcaccg 


tgtacacagc 


aatggatgtg 


atgaacctcc 


agcagcagc c 


caagaaagag 


gaaaacaaga 


acccaaacat 


tgtgaattac 


tgggttgtta 


tggaatactt 


ggctggaggc 


atggatgaag 


gecaaattge 


agctgtgtgc 


cattcgaacc 


aggtcattca 


cagagacatc 


ggctctgtca 


agctaactga 


ctttggattc 


cggagcacca 


tggtaggaac 


cccatactgg 


tatgggecca 


aggttgacat 


ctggtccctg 


gagcctccat 


acctcaatga 


aaaccctctg 


accccagaac 


ttcagaaccc 


agagaagctg 



aaacccccag cccctccgat gagaaatacc 60 
gctggaaccc taaaccatgg ttctaaacct 120 
aaggaccgat tttaccgatc cattttacct 180 
gageggecag agatttctct cccttcagat 240 
gctgtcacag gggagtttac eggaatgeca 300 
aatatcacta agteggagea gaagaaaaac 360 
tacaactcga agaagacatc caacagccag 420 
gaggattaca attcttctaa tgccttgaat 480 
ccaccagttt cagaagatga ggatgatgat 540 
gctccacgcc cagagcacac aaaatctgta 600 
gtcactccaa ctegggaegt ggctacatct 660 
ccaccagatg ctttgaccct taatactgag 720 
gaggagatct tggagaaatt acgaagcata 780 
acacggtttg agaagattgg acaaggtget 840 
gccacaggac aggaggtggc cattaagcag 900 
ctgattatta atgagatcct ggtcatgagg 960 
ttggacagtt acctcgtggg agatgagctg 1020 
tccttgacag atgtggtgac agaaacttgc 1080 
cgtgagtgtc tgeaggctet ggagtctttg 1140 
aagagtgaca atattctgtt gggaatggat 1200 
tgtgcacaga taaccccaga gcagagcaaa 1260 
atggcaccag aggttgtgac aegaaaggee 1320 
ggcatcatgg ccatcgaaat gattgaaggg 1380 
agagecttgt acctcattgc caccaatggg 1440 
tcagctatct teegggaett tctgaaccgc 1500 
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tgtctcgaga tggatgtgga gaagagaggt tcagctaaag agctgctaca gcatcaattc 1560 
ctgaagattg ccaagcccct ctccagcctc actccactga ttgctgcagc taaggaggca 1620 
acaaagaaca atcactaa 1638 



<210> 5 
<211> 1578 
<212> DNA 

<213> Homo sapiens 
<400> 5 

atgtctgata acggagaact ggaagataag cctccagcac ctcctgtgcg aatgagcagc 60 
accatcttta gcactggagg caaagaccct ttgtcagcca atcacagttt gaaacctttg 120 
ccctctgttc cagaagagaa aaagcccagg cataaaatca tctccatatt ctcaggcaca 180 
gagaaaggaa gtaaaaagaa agaaaaggaa cggccagaaa tttctcctcc atctgatttt 240 
gagcacacca tccatgttgg ctttgatgct gttactggag aattcactgg catgccagaa 300 
cagtgggctc gattactaca gacctccaat atcaccaaac tagagcaaaa gaagaatcct 360 
caggctgtgc tggatgtcct aaagttctac gactccaaca cagtgaagca gaaatatctg 420 
agctttactc ctcctgagaa agatggcctt ccttctggaa cgccagcact gaatgccaag 480 
ggaacagaag cacccgcagt agtgacagag gaggaggatg atgatgaaga gactgctcct 540 
cccgttattg ccccgcgacc ggatcatacg aaatcaattt acacacggtc tgtaattgac 600 
cctgttcctg caccagttgg tgattcacat gttgatggtg ctgccaagtc tttagacaaa 660 
cagaaaaaga agcctaagat gacagatgaa gagattatgg agaaattaag aactatcgtg 720 
agcataggtg accctaagaa aaaatataca agatatgaaa aaattggaca aggggcttct 780 
ggtacagttt tcactgctac tgacgttgca ctgggacagg aggttgctat caaacaaatt 840 
aatttacaga aacagccaaa gaaggaactg atcattaacg agattctggt gatgaaagaa 900 
ttgaaaaatc ccaacatcgt taactttttg gacagttacc tggtaggaga tgaattgttt 960 
gtggtcatgg aataccttgc tggggggtca ctcactgatg tggtaacaga aacagcttgc 1020 
atggatgaag cacagattgc tgctgtatgc agagagtgtt tacaggcatt ggagttttta 1080 
catgctaatc aagtgatcca cagagacatc aaaagtgaca atgtactttt gggaatggaa 1140 
ggatctgtta agctcactga ctttggtttc tgtgcccaga tcacccctga gcagagcaaa 1200 
cgcagtacca tggtcggaac gccatactgg atggcaccag aggtggttac acggaaagct 1260 
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tatggcccta aagtcgacat atggtctctg 
gagcctccat acctcaatga aaatcccttg 
accccagaac ttcagaatcc agagaaactt 
tgtttggaaa tggatgtgga aaaaaggggt 
ctgaaactgg ccaaaccgtt atctagcttg 
atgaagagta accgttaa 



ggtatcatgg ctattgagat ggtagaagga 1320 
agggccttgt acctaatagc aactaatgga 1380 
tccccaatat ttcgggattt cttaaatcga 1440 
tcagccaaag aattattaca gcatcctttc 1500 
acaccactga tcatggcagc taaagaagca 1560 

1578 



<210> 6 

<211> 1635 

<212> DNA 

<213> Homo sapiens 

<400> 6 

atgtctgacg gtctggataa tgaagagaaa 
aacaaccggg attcttcagc actcaaccac 
gagaagaata agaaagccag gcttcgctct 
aagaagaagg agaaagagcg cccagagatc 
catgtggggt ttgatgcagt caccggggaa 
ttactccaaa cttccaacat aacaaaattg 
gatgttctca aattctatga ttccaaagaa 
acatcaggag ataaaagtgc acatggatac 
gcatctgagc ctccattggc ccctcctgtg 
gaagaagatg aaaatgagcc accaccagtt 
atctatactc gttctgtggt tgaatccatt 
acaccaccct ctgctgaaaa tgccaattcc 
agaaaaaaat ccaagatgac agatgaggag 
gttggggacc caaagaaaaa atacacaaga 
actgtttata cagcactaga cattgcaaca 
cttcaacagc aacccaagaa ggaattaatt 
aagaacccta atattgttaa ttatttagat 
gtcatggaat acttggctgg tggctctctg 



cccccggctc ctccactgag gatgaatagt 60 
agctccaaac cacttcccat ggcccctgaa 120 
atcttcccag gaggagggga taaaaccaat 180 
tctcttcctt cagactttga gcatacgatt 240 
ttcactggaa ttccagagca atgggcacga 300 
gaacagaaga agaacccaca agctgttcta 360 
acagtcaaca accagaaata catgagcttt 420 
atagcagccc atccttcgag tacaaaaaca 480 
tctgaagaag aagatgaaga ggaagaagaa 540 
atcgcaccaa gaccagagca tacaaaatca 600 
gcttcaccag cagtaccaaa taaagaggtc 660 
agtactttgt acaggaacac agatcggcaa 720 
atcttagaga agctaagaag cattgtgagt 780 
tttgaaaaaa ttggtcaagg ggcatcaggt 840 
ggacaagagg tggccataaa gcagatgaac 900 
attaatgaaa ttctggtcat gagggaaaat 960 
agctacttgg tgggtgatga actatgggta 1020 
actgatgtgg tcacagagac ctgtatggat 1080 
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qaaggacaga 


tagcagctgt 


ctgcagagag 


aaccaggtga 


tccatagaga 


tataaagagt 


gttaaattga 


ctgactttgg 


gttctgtgcc 


actatggtgg 


gaaccccata 


ttggatggca 


ccgaaagttg 


atatctggtc 


ccttggaatt 


ccttacctta 


atgaaaatcc 


actcagggca 


gagctccaga 


atcctgagag 


actgtcagct 


gagatggatg 


tggataggcg 


aggatctgcc 


ttagccaagc 


ctctctccag 


cctgactcct 


aacagcagcc 


gctaa 





tgcctgcaag ctttggattt cctgcactca 1140 
gacaatattc ttctcgggat ggatggctct 1200 
cagatcactc ctgagcaaag taaacgaagc 1260 
cctgaggtgg tgactcgaaa agcttatggt 1320 
atggcaattg aaatggtgga aggtgaaccc 1380 
ttgtatctga tagccactaa tggaactcca 1440 
gtattccgtg actttttaaa tcgctgtctt 1500 
aaggagcttt tgcagcatcc atttttaaaa 1560 
ctgattatcg ctgcaaagga agcaattaag 1620 

1635 



<210> 7 

<211> 1776 

<212> DNA 

<213> Homo sapiens 



<400> 7 






atgtttggga 


agaggaagaa 


gcgggtggag 


gtgcacacgg 


gcttcgacca 


gcacgagcag 


agcctgatcg 


aggagtcggc 


tcgccggccc 


tccatccagc 


ccggggcccc 


caagaccatc 


gccctcacgc 


tgctgctgga 


cgagtttgag 


cggagagaca 


gcccgccgcc 


gcccgcccgt 


ccggccacca 


cggccagagg 


gggcccaggg 


cacagcgagg 


caggtggcgg 


cagtggtgac 


aagtcttcca 


gggagggctc 


agggggtccc 


tccgggcctg 


atgtcggcac 


cccccagcct 


gctggccggc 


cctttaacac 


ctacccgagg 


cagggggagc 


ctcatgacgt 


ggcccctaac 


cagtcctcct 


cctcctcctc 


ccggcctccc 


gtgctgggac 


cccacgcctc 


agagccccag 



atctccgcgc cgtccaactt cgagcaccgc 60 
aagttcacgg ggctgccccg ccagtggcag 120 
aagcccctcg tcgaccccgc ctgcatcacc 180 
gtgcggggca gcaaaggtgc caaagatggg 240 
aacatgtcgg tgacacgctc caactccctg 300 
gcccgccagg aaaatgggat gccagaggag 360 
aaggcaggca gccgaggccg gttcgccggt 420 
aggcgacggg cggggccaga gaagaggccc 480 
caggagtcct cccgggacaa acgccccctc 540 
gctggtctgg ccagtggggc gaaactggca 600 
gctgacacgg accacccatc ccggggtgcc 660 
gggccatcag cggggggcct ggccatcccc 720 
acccgagccc gaggtgcccc cagccctgga 780 
ctggcccctc cagcctgcac ccccgccgcc 840 
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cctgctgttc 


c tgggccccc 


tggcccccgc 


CSi t GELGC3LG t 


t CCCTQQC t QC 

V^V^^VJ V^y W V^y ^» 


c c tcrcacic to 


c t crcracaac t 


tcahcaacrat 


tCTCTCCTaCTCfQC 
i-yy^yayyyv- 


ccrcacrr' t~ perrr 

<-<y^wy v» yy 


crcaa ac* t* crcrt" 


CIGCOCS t" Pr5 f^CT 


crscfc t" oc t r* t~ 

yCAy ^ y \— 1 — L» 


fccaaccracrofc 


aa t" r5 a t~r*a t~ cr 


^ fc Cft" r5 C ri r*r5 

d v»y t»cxvc*ci\»ci 


err* 1~ r\ cc t* rrcr t 

y v i— civ — uyy 


yyyyy av -y a y 




r* friars t~ rrTt" 




trier* t~ trrrvsri 


uyL. i~y ^ayyv- 


i~y i-^yy i-y 


a t caacracrccr 


acfcccratcct 






ayy tyay i~.cj.gi 


yy aay LyL-L^ 


tggatggccc 


cagagctcat 


ctcccgcctt 


ctggggataa 


tggtgattga 


gatggtggac 


ctcaaagcca 


tgaagatgat 


tegggacaac 


gtgtcgccat 


ccctgaaggg 


cttcctggac 


gccacggcag 


ccgagctgct 


gaagcaccca 


atcgtgcccc 


tcatgcgcca 


gaaccgcacc 



tcaccacagc gggagecaca gegagtatec 900 
gtggtggacc caggcgaccc ccgctcctac 960 
tccacgggca tcgtgtgcat cgccaccgtg 1020 
aagatggacc tgegcaagea geagaggege 1080 
agggactacc agcacgagaa tgtggtggag 1140 
ctctgggtgg tcatggagtt cctggaagga 1200 
aggatgaacg aggagcagat cgcagccgtg 1260 
ctccacgccc agggegtcat ccaccgggac 1320 
gatggcaggg tgaagctgtc agactttggg 1380 
cgaaggaagt cgctggtcgg cacgccctac 1440 
ccctacgggc cagaggtaga catctggtcg 1500 
ggagagcccc cctacttcaa cgagccaccc 1560 
ctgccacccc gactgaagaa cctgcacaag 1620 
cgcctgctgg tgcgagaccc tgcccagcgg 1680 
ttcctggcca aggcagggee gcctgccagc 1740 
agatga 1776 



<210> 8 
<211> 581 
<212> DNA 



<213> Homo 


sapiens 




<400> 8 






gagaceggga 


acatggeget 


gggagcnctg 


gettegctga 


cagccggcca 


ccagatgttc 


agectaatta 


atgactgtct 


tctggacctc 


atattatcag 


aaatacactg 


tggtgaaatg 


gaaaaatgaa 


gaagactgac 


aagcaccagc 


cctctggagt 


ctttaattca 


tccacagcca 


aagaaaaaga 


aaaagattga 


aatatctggc 


gggtttgatc 


cacaagagca 


gaagtttacc 



tagcagctga gaaggggctg aggcaccgcc 60 
atgeattcta gagaaagtgg aaaacttaga 120 
tgagaccatg tttctagtgt tttccgtgga 180 
cttccacctc ttgctaaaat gaacactgag 240. 
gaaaagttgc agaatagaaa tagccacact 300 
tcatataaag gttttggcat catgtttggg 360 
ccgtccaact ttgaacacag ggt tea tact 420 
ggccttcccc agcagtggca cagcctgtta 480 
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gcagatacgg ccaacaggcc aaagcctatg gtggaccctt catgcatcac acccatccag 540 



<210> 9 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 9 

gcatcatgtt tgggaagaaa 20 

<210> 10 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : primer 



ctggctccta tgaagacatc gttagaggaa acaaaccctg c 



581 



<400> 10 



asctcwggkg ccatccarta 



20 



<210> 11 



<211> 1846 



<212> DNA 



<213> Homo sapiens 
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<400> 11 



gcatcatgtt 


tgggaagaaa 


aagaaaaaga 


ttgaaatatc 


tggcccgtcc 


aac 1 1 1 gaac 


60 


acagggttca 


tactgggttt 


gatccacaag 


agcagaagt t 


taccggcctt 


ccccagcagt 


120 


ggcacagcc t 


attaocaaat 


acaaccaaca 


ggc caaagee 


tataotaaac 


ccttcataca 


180 


tcacacccat 


c cacrc t acre t 


cc tat aaaaa 


caatcattaa 


aaaaaacaaa 


ccc tocaaaa 


240 


aaacctccat 


c aaccoc eta 


c t aaaocra 1 1 


t taacaaca t 


ctcaataact 


cap tccaar t 


300 

J WW 


ccc taaaaaa 


aaaaacrccca 


CCCPiCCCCPiO 




cc Piacc pic 


yy uv^v*ayy^v* 


360 


accrc aaaaaa 


aaa t oac 1 1 c 


atcaccttct 


cccaotattc 


caocoaa tec 


aatapt art*o 

yauuw ^a\a> t*y 


420 


P t~ Clrl C t" A PA P 


ry^rpCrA A A AP 
y ci^wV— .y aaaay 


fflna oopa pa 


Pi PAPt" P t P t" A 
ciy ciy L-v^ l-ci 


1~ nnfl np\ t~ t* 

Lyyayauya l. 


P t" PvrrA t PPPt* 
l» LyyaL^L.y l 


lOw 


a l» La L-Ciy ciy y 


cpicrccpi ccicpi 
Lay ^,v - .(C*\w.y Ld 


y l l cicty i_ add 

i 


ci *-.y yy v-ci^y l. 


caa. Ly aaaa L.y 


aayuauyyyy 


RAD 

L/*±vJ 


ay y^^* Luu uci 


t~ 1~ p t pa nrrh rr 
l tyayy Ly 


a a rrr p 1~ t t~pA 

uay u. i_y a 




LyLLaya ulu 


ll> LyLLya l l 




Cl LLdL LLdLd 


l l uyyaL LLa 


^- i_.yciy wdciciv^ 


v^ciciy Lyaci La 


Lay Lyat-L. ll 


day Ly y y ay l 


DDU 


p\ t~ c*p\ rr^ rr^ rrr 


l ^^y a y Lay*- 




Cl U La L L L 


LLaaLLLata 


rr'h t~rt* a/*tj^ 
LL. L LL- Layaa 


790 


r t* nr ^ nnrrrfl r* 
l Ly Ldyy ydL 


Ldy Lyy y LyL 


ULuaayyaya 


yLL uyy L.y La 


Lay Lyaaay L 


y aa LyyyydL 


/ ou 


npa rir*C V C\CtP\ 

LLdy ll uyya 


uyaL. La LyaL 


ciy y ciy y ^ cia. . 


Cty L.L. L. LLy L-Cl 


r» r t* cip\ f~ r* rr 
LL Ly aa LLay 


aLaayLLL LL 


o^± u 


uyv- ^ i^aL v^a o 


y *-y y ^* a y »y y 


tLL«yy LLay 


p;p t* rnrrrra p t~ 
yv LLyyyaL l 


LLayyaaLLy 


d Ly d Ly LLd l 


900 




L,y La L l- Laaa 




P\ P\ C\C\P\ cpkcY c 


L. LOLuuL LLL 


i~ ^ r* p^ rr t* c p 

laLaLL LaLL 




p t" ccsc t~ t" r;t p 


ppa cicc c Pi p a 

v— y ay ^v-^av-a 


^ 1" p/t rrra 1 1* p 


caaayy oyya 


t" 1" Pi ceiPi t" rna 


yua^uya uyy 


1020 

J. U4. W 


tcctcaaccc 




y y y uya^a. 


pptappppAor 


rrrrrrrrrforr 
yy y ^ uy^v^ 


aaart 1 APPt"P 


1080 


aaaa t c aaaa 


caaa t c aaac 


tattcctcaa 


acaatcacca 


atacccotc t 


y y y ^-c*v«^c*v*a 


1140 


aagccacctt 


ataccatcac 


ccctccctac 


aoaacaat t c 


acaatacatc 


t c c acaac 1 1 


1200 


cctacctgag 


ctccctcagc 


c tc tcatcca 


gcacctaccc 


gc cgee cage 


tggggc t cc t 


1260 


cctccgacca 


gcagccctcc 


aacratatccc 


atgaacagtt 


t caaacaac c 


ctacaactaa 


1320 


tggtcagccc 


aggagacccc 


agggaa tac t 


tggecaaett 


tatcaaaatc 


ggggaaggct 


1 "3 OA 


caaccggcat 


egtatgeate 


ggcaccgaga 


aacacacagg 


gaaacaagtt 


gcagtgaaga 


1440 


aaatggacct 


ccggaagcaa 


cagagacgag 


aactgetttt 


caatgaggtc 


gtgatcatgc 


1500 


gggattacca 


ccatgacaat 


gtggttgaca 


tgtacagcag 


ctaccttgtc 


ggegatgage 


1560 


tctgggtggt 


catggagttt 


ctagaaggtg 


gtgecttgae 


agacattgtg 


actcacacca 


1620 


gaatgaatga 


agaacagata 


gctactgtct 


gcctgtcagt 


tctgagagct 


ctctcctacc 


1680 


ttcataacca 


aggagtgatt 


cacagggaca 


taaaaagtga 


ctccatcctc 


ctgacaagcg 


1740 


atggccggat 


aaagttgtct 


gattttggtt 


tctgtgctca 


agtttccaaa 


gaggtgeega 


1800 
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agaggaaatc attggttggc actccctact ggatggcccc tgagct 1846 

<210> 12 

<211> 417 

<212> DNA 

<213> Homo sapiens 

<400> 12 

ataaagttgt ctgattttgg tttctgtgct caagtttcca aagaggtgcc gaagaggaaa 60 
tcattggttg gcactcccta ctggatggcc cctgaggtga tttctaggct accttatggg 120 
acagaggtgg acatctggtc cctcgggatc atggtgatag aaatgattga tggcgagccc 180 
ccctacttca atgagcctcc cctccaggcg atgcggagga tccgggacag tttacctcca 240 
agagtgaagg acctacacaa ggtttcttca gtgctccggg gattcctaga cttgatgttg 300 
gtgagggagc cctctcagag agcaacagcc caggaactcc tcggacatcc attcttaaaa 360 
ctagcaggtc caccgtcttg catcgtcccc ctcatgagac aatacaggca tcactga 417 

<210> 13 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 
<400> 13 

gagaccggga acatggcgct 20 

<210> 14 
<211> 22 
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<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: printer 
<400> 14 

tcagtgatgc ctgtattgtc tc 22 
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